python 爬虫百度贴吧签到小工具 - 代码天地

python 爬虫百度贴吧签到小工具

其他 2018-09-03 01:18:28 阅读次数: 0

import requests,re,time
header ={
    "Cookie":"登陆过账号后的cookie 必须填写", 
    "User-Agent":"Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/68.0.3440.106 Safari/537.36"
}
#访问个人帐号下的贴吧主页
url = "百度首页--右上角贴吧--右上角用户名（我的贴吧） 然后把url填到这里"
html = requests.get(url,headers=header)
#print(html.text)

#提取贴吧相关的ID 名称等信息
s1 = r'"forum_id":(.*?),"forum_name":"(.*?)"'
tieba_info = re.compile(s1,re.S).findall(str(html.text))
#print(tieba_info)

for i in tieba_info:
    time.sleep(3)#访问CD要控制好，否则容易出现验证码，导致签到失败
    print(i[1])
    print(i[1].encode("latin-1"))
    #获取可以签到的全部贴吧名字
    #print(i[1].encode("latin-1").decode("unicode_escape"))

    #获取tbs 发送签到请求需要获得名为tbs的数据 他在页面信息里面
    tieba_name = (i[1].encode("latin-1").decode("unicode_escape"))
    tieba_link = "https://tieba.baidu.com/f?kw=" + tieba_name
    info = requests.get(tieba_link,headers=header)
    #print(info.text)
    s2 =r"tbs': \"(.*?)\"" #单双引号都有 注意转义字符
    tieba_tbs = re.compile(s2,re.S).findall(str(info.text))[0]
    #print(tieba_tbs)

    #签到的postdata
    qiandao_url = "https://tieba.baidu.com/sign/add"
    qiandao_data = {"ie":"utf-8",
                    "kw":tieba_name,
                    "tbs":tieba_tbs} #tbs这个数据意义不明 可以在附近相关网页代码中搜索看看 是否能发现关联

    #实现签到 是否成功 可以看返回信息
    try:
        qiandao = requests.post(qiandao_url,data=qiandao_data,headers=header)
        #print(qiandao.text)
        print(tieba_name,"签到")

    except:
        print(tieba_name,"异常")
        continue

猜你喜欢

转载自www.cnblogs.com/cwkcwk/p/9576518.html

python 爬虫百度贴吧签到小工具

[python]百度贴吧爬虫

Python爬虫-百度贴吧

Python爬虫实战01 ---- 百度贴吧一键签到

python爬虫的使用——百度图片查找筛选小工具

Python之scrapy实现的爬虫，百度贴吧的自动签到和自动发帖、自动回帖

Python爬虫之百度贴吧

Python爬虫实战：百度贴吧帖子

python爬虫爬取百度贴吧图片

Python爬虫实践：获取百度贴吧内容

python爬虫学习之百度贴吧抓取

python爬虫爬取百度贴吧帖子

Python爬虫(一)爬百度贴吧

Python实现百度贴吧数据爬虫

[Python]网络爬虫（九）：百度贴吧的网络爬虫（v0.4）源码及解析

Python爬虫--- 1.5 爬虫实践：获取百度贴吧内容

从零开始写Python爬虫 -1.3 爬虫实践：获取百度贴吧内容

python+selenium百度贴吧自动签到

芝麻HTTP:Python爬虫实战之爬取百度贴吧帖子

python 爬虫（一）爬取百度贴吧图片

python爬虫入门实战（二）---爬百度贴吧

Python-简单的爬虫案例（百度贴吧-图片）

【Python】百度贴吧图片的爬虫实现（努力努力再努力）

关于使用Python写——使用类和对象封装百度贴吧爬虫

关于使用Python——写百度贴吧爬虫函数版

Python3爬虫爬取百度贴吧

[Python爬虫之路2]爬取百度贴吧内容

Python实现简单爬虫功能--批量下载百度贴吧里的图片

Python爬虫系列之百度贴吧爬取

python爬虫爬取百度贴吧（入门练习）

今日推荐

Linus “吃狗粮”最积极！

开源日报 | Winamp播放器即将开源；生成式AI之战升级第二轮；Linus“吃狗粮”最积极；AI进入泡沫前期；吴泳铭为阿里云带来了什么？

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

周排行

SVN服务端安装在阿里云

实战 | 相机标定

webpack核心概念

note20——》只要肯低头吃苦，人生就会有救

PAT甲级 1062 Talent and Virtue （25 分）排序

NG Toolset开发笔记--5GNR Resource Grid（26）

如何对待上司

oracle命令

第9章 STL迭代器

logstash使用es映射模板

每日归档

更多

2024-05-20(36)

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)