Python爬虫刷博客访问量 - 代码天地

Python爬虫刷博客访问量

其他 2019-01-26 09:51:06 阅读次数: 0

import re
import requests
from requests import RequestException
import time
import random
def get_page(url):
	try:
		headers = {
			'Referer': 'https://blog.csdn.net',  # 伪装成从CSDN博客搜索到的文章
			'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.75 Safari/537.36'  # 伪装成浏览器
		}
		response = requests.get(url, headers=headers)
		if response.status_code == 200:
			return response.text
		return None
	except RequestException:
		print('请求出错')
		return None
def parse_page(html):
	try:
		read_num = int(re.compile('<span.*?read-count.*?(\d+).*?</span>').search(html).group(1))
		return read_num
	except Exception:
		print('解析出错')
		return None
def main():
	try:
		while 1:
			url = 'https://blog.csdn.net/swustzhaoxingda/article/details/84324164'  # 待刷浏览量博客的url
			html = get_page(url)
			if html:
				read_num = parse_page(html)
				if read_num:
					print('当前阅读量：', read_num)
			url = 'https://blog.csdn.net/swustzhaoxingda/article/details/86614225'  # 待刷浏览量博客的url
			html = get_page(url)
			if html:
				read_num = parse_page(html)
				if read_num:
					print('当前阅读量：', read_num)
			url = 'https://blog.csdn.net/swustzhaoxingda/article/details/86591922'  # 待刷浏览量博客的url
			html = get_page(url)
			if html:
				read_num = parse_page(html)
				if read_num:
					print('当前阅读量：', read_num)
			url = 'https://blog.csdn.net/swustzhaoxingda/article/details/86617054'  # 待刷浏览量博客的url
			html = get_page(url)
			if html:
				read_num = parse_page(html)
				if read_num:
					print('当前阅读量：', read_num)
			sleep_time = random.randint(60, 83)
			print('please wait', sleep_time, 's')
			time.sleep(sleep_time)  # 设置访问频率，过于频繁的访问会触发反爬虫
	except Exception:
		print('出错啦！')
if __name__ == '__main__':
	main()

猜你喜欢

转载自blog.csdn.net/swustzhaoxingda/article/details/86617203

Python爬虫刷博客访问量

python爬虫设计刷博客访问量（刷访问量，赞，爬取图片）

python爬虫刷访问量 2019

python 爬虫刷访问量

Python爬虫1：博客访问量

Python3刷csdn博客访问量

python使用urllib刷博客访问量技术实现

Python 刷访问量

python爬虫实战：刷某博客站点的访问量（转）

python2.7爬虫脚本实现刷取CSDN博客访问量。

用爬虫来对csdn个人博客进行访问，刷访问量

python 爬虫爬去自己博客的访问量

python requests、xpath爬虫增加博客访问量

(最新)使用爬虫刷CSDN博客访问量——亲测有效

Python代码刷访问量

利用python刷CSDN访问量

增加博客访问量（Python）

Python-批量刷博客园访问量脚本

Python--Selenium爬虫刷CSND访问量！我说怎么访问量这么高呢！

【Python脚本】-爬虫得到CSDN博客的文章访问量和评论量

Python3.7实现自动刷博客访问量（只需要输入用户id）（转)

python刷CSDN访问量的简单方法！

Python实战之网页刷访问量方法

Python自动刷取csdn文章访问量

Python实战：使用selenium刷访问量

使用python爬取csdn博客访问量

Python网络数据采集（1）：博客访问量统计

Python统计博客园访问量

利用Python爬虫刷店铺微博等访问量最简单有效教程

爬虫小练（刷访问量）（python+requests（headers+proxy)+Queue+threading）

今日推荐

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

Java自定义时间格式

同步整形电路

在开发中最最最常用的字符串的属性大集合

Linux 查看端口占用并杀掉

Java基础四：ArrayList

多线程之死锁就是这么简单

mysql 基础命令集

awk 命令详解

Centos6.3编译安装nginx+php步骤

OCR （Optical Character Recognition，光学字符识别）

每日归档

更多

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)