beautsoap reptiles

Reading from a html file

from bs4 import BeautifulSoup
html_doc="文件地址"
html_file=open(html_doc,"r")
html_handle=html_file.read()
soup=BeautifulSoup(html_handle,'html.parser') #选择解析方法
print(soup)

Reading from a web page

from bs4 import BeautifulSoup
import requests
url="http://www.cnblogs.com/j-c-y/p/11129345.html"
page=requests.get(url).text
soup=BeautifulSoup(page,'html.parser') #选择解析方法
print(soup)

Guess you like

Origin www.cnblogs.com/j-c-y/p/11454855.html