python-python爬取豆果网（菜谱信息） - 代码天地

python-python爬取豆果网（菜谱信息）

企业开发 2019-01-22 21:21:09 阅读次数: 0

 1 #-*- coding = utf-8 -*-
 2 #获取豆果网图片
 3 import io
 4 from bs4 import BeautifulSoup
 5 import requests
 6 
 7 url = "https://www.douguo.com/cookbook/2029254.html"
 8 
 9 header = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:23.0) Gecko/20100101 Firefox/23.0'}
10 html = requests.get(url,headers = header)
11 text = BeautifulSoup(html.content,"lxml")
12 img_title = text.select("#banner img")
13 imgg = img_title[0].get("src")
14 
15 
16 def get_img_data(ul):
17     htm = requests.get(ul,headers = header)
18     f =  open("1.jpg","wb")
19     f.write(htm.content)
20     f.close()
21 menu_img   = get_img_data(imgg)
22 menu_title_0 = text.select('.title.text-lips')[0].text
23 menu_intro   = text.select('.intro')[0].text
24 menu_title_1 = text.select('.mini-title')[0].text
25 menu_content_scname = text.find_all('span',class_='scname')
26 menu_content_scnum = text.find_all('span',class_='scnum')
27 menu_title_2 = text.select('.mini-title')[1].text
28 menu_step = text.select('.stepinfo')
29 
30 print(menu_title_0)
31 print(menu_intro)
32 print(menu_title_1)
33 count = 0
34 for i in menu_content_scname:
35     print(i.text," ",menu_content_scnum[count].text)
36     count = count + 1
37 print(menu_title_2)
38 for menu_step_i in menu_step:
39     print(menu_step_i.text)

View Code

猜你喜欢

转载自www.cnblogs.com/0526yao/p/10306119.html

python-python爬取豆果网（菜谱信息）

python-python爬取妹子图片

用python爬取热门菜谱清单

爬虫：爬取豆果网和美食网的菜单

对拉勾网职位信息的爬取（python）

python爬取知网论文信息

Python 爬取拉勾网python职位信息

#python学习笔记#使用python爬取拉勾网职位信息（二）：爬取数据

Python 爬虫爬取安智网应用信息

Python 3.6 优雅的爬取猎聘网招聘信息

python scrapy爬取当当网商品信息

用Python爬取拉钩网招聘职位信息

python爬虫— 拉勾网职位信息爬取

python爬虫爬取淘宝网商品信息

简单python爬虫爬取拉勾网职位信息

通过Python爬取拉勾网的职位信息

Python使用request爬取拉钩网信息

Python3爬取猎聘网招聘信息

利用python爬取贝壳网租房信息

Python 爬取赶集网租房信息

python爬取慕课网课程信息

python爬虫之爬取《贵州农经网》信息

python requests爬取拉勾网职位信息

python爬取科学网基金项目信息

selenium爬取拉勾网python职位信息

python爬虫-selenium爬取链家网房源信息

python爬虫练习爬取美团网酒店信息

python爬取知网

python爬取拉钩网

python爬取网图

今日推荐

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

周排行

Family Tree 题解

BZOJ 1093 最大半连通子图 SCC + DP

幂等处理

Spring----学习（2）----XML 配置Bean 自动装配

SQL Server 远程更新目标表数据

HIbernate3.6 环境搭建

特殊符号正则表达式

【Linux】第一章进程的理解

843. n-皇后问题（dfs+输出各种情况）

空间数据库2

每日归档

更多

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)