使用requests库和re库爬取微博热搜前十榜单 - 代码天地

使用requests库和re库爬取微博热搜前十榜单

编程语言 2023-07-29 17:45:27 阅读次数: 0

import requests
import re
import chardet
headers={
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/112.0.0.0 Safari/537.36 Edg/112.0.1722.39'
}

response = requests.get('https://tophub.today/n/KqndgxeLl9',headers=headers)


encoding = chardet.detect(response.content)['encoding']


html_content = response.content.decode(encoding)


top_ten_regex = r'<td class=".*?"><a href=".*?">(.*?)</a>'
top_ten_heats  = r'<td>(\d.*?)</td>'

top_ten_matches = re.findall(top_ten_regex, html_content, re.DOTALL)
top_ten_heat = re.findall(top_ten_heats,html_content,re.DOTALL)

print("Top Ten List:")

for i in range(10):
    print("{}.{}:{}".format(i+1,top_ten_matches[i],top_ten_heat[i]))

猜你喜欢

转载自blog.csdn.net/weixin_51395932/article/details/130179006

使用requests库和re库爬取微博热搜前十榜单

Python用requests库+BeautifulSoup库+re库获取微博热搜（有详解）

爬取微博热搜榜单存入mysql并部署在云服务器上

python 爬取微博实时热搜，并存入数据库实例

爬取微博热搜榜

爬取微博前十热点

requests和re库爬取淘宝商品信息

网络爬虫（微博热搜榜单）

微博热搜排行榜前十

爬取搜狗热搜榜前十

java爬虫爬取微博热搜榜

Python网络爬虫-爬取微博热搜

Python网络爬虫之爬取微博热搜

Python爬取新浪微博热搜榜

webMagic入门案例 -- 爬取微博热搜

爬取微博热搜排行榜

爬取微博热搜Top25的数据

用python爬取微博热搜数据并保存

36行代码爬取微博热搜榜和要闻榜

微博热搜榜前20信息数据爬取进行数据分析与可视化

利用Python3的requests和re库爬取猫眼电影笔记

爬取微博热搜数据进行数据分析与可视化处理

利用python爬取微博热搜榜制作词云图

Python定时爬虫爬取微博热搜数据 pyecharts动态图展示

python爬虫爬取微博知乎热搜榜

利用requests和lxml库爬取电影天堂中最新电影前10页

requests库爬取猫眼电影“最受期待榜”榜单 --网络爬虫

数据爬取——requests库

用PYTHON的requests库和re库抓取博主粉丝ID号

python战反爬虫：爬取猫眼电影数据 (一）（Requests, BeautifulSoup, MySQLdb,re等库)

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)