Web Crawl solve the garbage problem

solve:
Import gzip decompression method
import urllib.request
from io import BytesIO
import gzip

url = 'http://www.mzitu.com'
response = urllib.request.urlopen(url)
html = response.read()
print(html)
buff = BytesIO(html)
f = gzip.GzipFile(fileobj=buff)
res = f.read().decode('utf-8')

print(res)

  

Guess you like

Origin www.cnblogs.com/wumac/p/12048583.html