re requests target = (urlname): url = target + url ###组合网站urlcontent = requests.get(url).textcontent = re.search(content) text = content.group() text = re.sub(text) result = re.sub(text) (+name) () f: f.write(result) f.close() (): url = response = requests.get(url) html = response.texthtml = re.findall(html) i = line html: i < : i = i + :zhengwen(line[]line[]) ###爬去目录的url mulu()
爬虫学习——1
猜你喜欢
转载自blog.51cto.com/13155409/2117709
今日推荐
周排行