python爬虫使用requests和BeautifulSoup出现中文乱码 - 代码天地

python爬虫使用requests和BeautifulSoup出现中文乱码

编程语言 2018-10-09 16:47:25 阅读次数: 0

版权声明：本文为博主原创文章，未经博主允许不得转载。 https://blog.csdn.net/Song_Lynn/article/details/82959708

python爬虫使用requests和BeautifulSoup出现中文乱码

requests和BeautifulSoup都是自行检测网页编码并进行编码的，所以可能会出现检测错误，需要手动更改编码方式，使得中文能够正常显示

from bs4 import BeautifulSoup
import requests

headers = {
	'user_agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.62 Safari/537.36'
}
res = requests.get('http://info.2016.163.com/athlete/1280.html', headers=self.headers)
res.encoding = 'utf-8'		# 去掉这句会造成中文显示乱码，其中utf-8是根据网页源代码的编码格式指定的，也有可能是如gk18030等
soup = BeautifulSoup(res.text, 'lxml')
body_soup = soup.html.body
info = body_soup.select('.brief .table')[0]
print(info.h1.contents[0])		# 安-弗雷泽

猜你喜欢

转载自blog.csdn.net/Song_Lynn/article/details/82959708

python爬虫使用requests和BeautifulSoup出现中文乱码

Python爬虫之BeautifulSoup和requests的使用

python使用requests和BeautifulSoup爬取网页乱码问题

爬虫【三】 requests和BeautifulSoup的使用

爬虫库requests和BeautifulSoup的基本使用

爬虫 - requests 和 BeautifulSoup

python爬虫基础（requests、BeautifulSoup）

爬虫入门——requests和Beautifulsoup

【Python网络爬虫】使用requests和beautifulsoup4库轻松实现

Python爬虫实战：使用Requests和BeautifulSoup爬取网页内容

Python使用requests及BeautifulSoup构建爬虫实例代码

python3 爬虫相关-requests和BeautifulSoup

Python学习笔记11：爬虫（requests和BeautifulSoup）

python爬虫基础入门——利用requests和BeautifulSoup

python爬虫第一弹之图片- BeautifulSoup与requests的完美结合（用requests和BeautifulSoup进行爬虫）

python学习之 requests爬虫导致的中文乱码

python安装requests和BeautifulSoup

python爬虫之requests+selenium+BeautifulSoup

python3 爬虫（requests+BeautifulSoup）

Python网络爬虫笔记（四）——requests与BeautifulSoup

python爬虫基础Ⅰ——requests、BeautifulSoup：书本信息

python爬虫爬取招聘（ requests，BeautifulSoup）

使用requests+BeautifulSoup的简单爬虫练习

python---requests和beautifulsoup4模块的使用

【爬虫】使用BeautifulSoup、requests和you_get爬虫下载B站视频

python解决Requests中文乱码

python爬虫入门（二）——requests库的常见方法和使用（requests库中文官网）

Python爬虫中文乱码

【爬虫学习一】 Python实现简单爬虫（requests，BeautifulSoup）

一个超实用的python爬虫功能使用 requests BeautifulSoup

今日推荐

富文本编辑器 Quill 2.0 重磅发布，特性、可靠性与开发者体验大幅提升

“开源信徒”周鸿祎开源360智脑大模型

周排行

Ubuntu 14.04 下Fuel6.0安装部署

香港一小巴侧翻致1死16伤警方：未见机件故障

pikachu--XSS盲打

阅读深入理解JVM虚拟机笔记一

java.sql.SQLException: ORA-00932: 数据类型不一致: 应为 -, 但却获得 CLOB

oracle delete all object under an user

[LeetCode]20 Valid Parentheses 有效的括号

树形DP求树的直径【模板】

Context propagation over HTTP in Go

【PAT】（B）1053 住房空置率 (20)*

每日归档

更多

2024-04-18(0)

2024-04-17(5)

2024-04-16(70)

2024-04-15(42)

2024-04-14(0)

2024-04-13(119)

2024-04-12(38)

2024-04-11(14)

2024-04-10(68)

2024-04-09(5)