python爬虫实践——爬取百度首页 - 代码天地

python爬虫实践——爬取百度首页

编程语言 2019-01-09 09:40:47 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/muzhiqian/article/details/86131765

写一个最简单的例子，爬取百度首页右上角的“新闻”链接的名称和其URL。

截取新闻的xpath,(F12,选择新闻两字，右击，选择Copy-Copy Xpath).(注意：若登录百度，相应xpath会改变，此为非登录状态)

import requests
from lxml import etree

response = requests.get('https://www.baidu.com')

response.encoding='utf-8'

selector = etree.HTML(response.text)
news_text = selector.xpath('//*[@id="u1"]/a[1]/text()')[0]
news_url=selector.xpath('//*[@id="u1"]/a[1]/@href')[0]
print(news_text)
print(news_url)

输出：

新闻
http://news.baidu.com

猜你喜欢

转载自blog.csdn.net/muzhiqian/article/details/86131765

python爬虫实践——爬取百度首页

python入门爬虫之爬取百度首页的热搜榜

Python爬虫案例：爬取百度图片

python爬虫，爬取百度图片

python爬虫爬取百度贴吧图片

Python爬虫百度360信息搜索并爬取

python爬虫小程序,爬取百度图片

python --爬虫--爬取百度翻译

python爬虫爬取百度图片

python爬虫爬取百度贴吧帖子

python爬虫模拟登录爬取百度图片

python爬虫篇2：爬取百度图片

爬虫——百度图片爬取

PHP爬虫-爬取百度贴吧首页违规主题贴

百度图片爬虫-python版-如何爬取百度图片?

scrapy 试用爬取百度首页

爬虫_使用urllib库无任何反爬手段爬取百度首页

自学python，第一次爬取百度首页

Python爬取百度图片

Python 爬取百度音乐

python——百度文库爬取

Python 爬取百度图片

【Python】爬取百度图片

Python 百度图片爬取

python简单爬虫爬取百度百科python词条网页

简单的python爬虫（爬取百度百科词条）

python爬虫入门--爬取百度百科10000条记录

Python 爬虫实例（15）爬取百度百聘

Python爬虫爬取百度百科词条

python 爬虫——针对query爬取百度百科页面

今日推荐

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

周排行

rbac——界面、权限

Apache CXF + SpringMVC 整合发布WebService

so插件化

Vue.js实战系列---图标字体制作（svg格式）

PAT乙级 1007 素数对猜想(孪生素数对) (20分) ---（C语言 + 详细注释）

被IRM保护的文档，打开失败

Calendar和Date计算日期差的小问题

win10子系统ubuntu18.4安装docker

利用Wrap Shell Script定位Android Native内存泄漏

MySQL: Transaction (Part I - Basic Concept)

每日归档

更多

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)