Python爬虫之BeautifulSoup和requests的使用 - 代码天地

Python爬虫之BeautifulSoup和requests的使用

其他 2018-06-14 11:52:46 阅读次数: 2

requests，Python HTTP 请求库，相当于 Android 的 Retrofit，它的功能包括 Keep-Alive 和连接池、Cookie 持久化、内容自动解压、HTTP 代理、SSL 认证、连接超时、Session 等很多特性，同时兼容 Python2 和 Python3。

第三方库的安装：

pip install urllib

pip install requests

小爬虫代码如下：

# -* - coding: UTF-8 -* -
#导入第三方库 
import urllib
from bs4 import BeautifulSoup
import requests
url='https://www.phb123.com/junshi/lishi/9679_2.html'
local="E:\\py\\imgs\\"    #保存图片的文件夹
html_doc=requests.get(url).text
soup=BeautifulSoup(html_doc,'lxml')   #解析 html_doc
contens=soup.find_all('center')
x=1
for con in contens:
    imgs=con.find_all('img') #获取center标签下的img标签
    for img in imgs:
        urllib.request.urlretrieve(img['src'], local + '%s.jpg' % (x))
        x =x+1

猜你喜欢

转载自www.cnblogs.com/ling-yu/p/9182277.html

Python爬虫之BeautifulSoup和requests的使用

python爬虫使用requests和BeautifulSoup出现中文乱码

python爬虫之requests+selenium+BeautifulSoup

python爬虫第一弹之图片- BeautifulSoup与requests的完美结合（用requests和BeautifulSoup进行爬虫）

爬虫【三】 requests和BeautifulSoup的使用

爬虫库requests和BeautifulSoup的基本使用

爬虫 - requests 和 BeautifulSoup

python爬虫之beautifulsoup的使用

python爬虫基础（requests、BeautifulSoup）

爬虫入门——requests和Beautifulsoup

【Python网络爬虫】使用requests和beautifulsoup4库轻松实现

Python爬虫实战：使用Requests和BeautifulSoup爬取网页内容

Python使用requests及BeautifulSoup构建爬虫实例代码

python3 爬虫相关-requests和BeautifulSoup

Python学习笔记11：爬虫（requests和BeautifulSoup）

python爬虫基础入门——利用requests和BeautifulSoup

python爬虫之requests的使用

python 爬虫之BeautifulSoup 库的基本使用

python之爬虫（八）BeautifulSoup库的使用

Python爬虫之Beautifulsoup模块的使用

Python爬虫之BeautifulSoup使用指南

python安装requests和BeautifulSoup

python爬虫之BeautifulSoup

python爬虫之BeautifulSoup

Python爬虫之urllib库和requests库的基本使用

python3 爬虫（requests+BeautifulSoup）

Python网络爬虫笔记（四）——requests与BeautifulSoup

python爬虫基础Ⅰ——requests、BeautifulSoup：书本信息

python爬虫爬取招聘（ requests，BeautifulSoup）

使用requests+BeautifulSoup的简单爬虫练习

今日推荐

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

最强开源大模型 Llama 3 上架 Gitee AI

虽然老乡鸡开源的不是代码，但背后的原因却让人很暖心

富文本编辑器 Quill 2.0 重磅发布，特性、可靠性与开发者体验大幅提升

周排行

使用Redis中间件解决商品秒杀活动中出现的超卖问题（使用Java多线程模拟高并发环境）

野指针及c++指针使用注意点

redis 3.0　新特性

(翻译)火狐操作系统javascript API

微信小程序开发入门

mysql数据查询之五子句(where、group by、having、order by和limit)

Codeforces Round #517 Div. 1翻车记

在caffe 中实现Generative Adversarial Nets（二）

企业级漏洞扫描工具

java byte数组与String互转

每日归档

更多

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)

2024-04-14(0)