python使用requests和BeautifulSoup爬取网页乱码问题 - 代码天地

python使用requests和BeautifulSoup爬取网页乱码问题

其他 2018-08-09 15:16:14 阅读次数: 0

微信搜索关注“程序员旅途”，查看更多

python使用requests和BeautifulSoup爬取网页乱码问题

requests和beautifulsoup模块都会自行评测原网页的编码格式，所以存在评测错误的情况，所以可以在requests爬取之后Beautifulsoup调用之前对内容进行编码(设为网页本身的编码格式)即可，例如：

网页编码为：

     [python]  
    view plain copy
#encoding=utf-8  
import requests  
from bs4 import BeautifulSoup  
html = requests.get("http://www.baidu.com/")  
html.encoding='utf-8'#去掉这句则乱码，加上则正常显示，其中utf-8是根据网页源代码中设置的编码格式来指定的  
soup = BeautifulSoup(html.text,'lxml')  
print(soup.title.text)  

乱码显示：

正常显示：

猜你喜欢

转载自blog.csdn.net/leosblog/article/details/79832614

python使用requests和BeautifulSoup爬取网页乱码问题

Python爬虫实战：使用Requests和BeautifulSoup爬取网页内容

用requests和BeautifulSoup爬取静态网页

Python使用BeautifulSoup爬取网页信息

Python使用urllib,urllib3,requests库+beautifulsoup爬取网页

requests与BeautifulSoup爬取网页图片

python 简单爬取本地文档与爬取网页使用requests和bs4，及自己问题的解决

Python使用BeautifulSoup与Requests爬取大学排名

使用Requests和BeautifulSoup爬取妹子图

python使用requests和BeautifulSoup包爬取Pixiv图片--指定tag下的所有作品

python爬虫爬取招聘（ requests，BeautifulSoup）

python获取网页page数，同时按照href批量爬取网页（requests+BeautifulSoup）

Python爬虫学习三------requests+BeautifulSoup爬取简单网页

python 爬虫（一） requests+BeautifulSoup 爬取简单网页代码示例

python爬虫——利用requests库BeautifulSoup定向爬取网页内容写入txt文件

python爬虫——利用requests库BeautifulSoup简单爬取网页上照片—代码完善

python爬虫——利用requests库BeautifulSoup简单爬取网页上照片

python爬虫使用requests和BeautifulSoup出现中文乱码

requests与BeautifulSoup结合爬取网页数据应用

xpath和beautifulsoup爬取网页的demo

Python使用requests爬取一个网页并保存

Python爬虫学习（一）使用Requests和正则表达式爬取简单网页

关于Python BeautifulSoup 爬取网页信息中文乱码解决方法

python requests的网页乱码问题

如何使用 Python 和 BeautifulSoup 爬取网站

python requests 简单网页文本爬取

python的requests模块爬取网页内容

Python3爬虫--两种方法（requests(urllib)和BeautifulSoup）爬取网站pdf

利用python的requests和BeautifulSoup库爬取小说网站内容

使用requests+BeautifulSoup爬取龙族V小说

今日推荐

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

周排行

Family Tree 题解

BZOJ 1093 最大半连通子图 SCC + DP

幂等处理

Spring----学习（2）----XML 配置Bean 自动装配

SQL Server 远程更新目标表数据

HIbernate3.6 环境搭建

特殊符号正则表达式

【Linux】第一章进程的理解

843. n-皇后问题（dfs+输出各种情况）

空间数据库2

每日归档

更多

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)