python爬虫基础（requests、BeautifulSoup） - 代码天地

python爬虫基础（requests、BeautifulSoup）

其他 2018-06-08 16:32:47 阅读次数: 5

抽屉网实现批量点赞

 1 from selenium import webdriver
 2 import urllib
 3 from urllib.request import urlopen
 4 from  bs4 import BeautifulSoup
 5 import re
 6 import time,datetime
 7 import random
 8 
 9 # pages=set()
10 # def getlinks(pageurl):
11 #     html = urlopen("http://en.wikipedia.org"+pageurl)
12 #     bsobj = BeautifulSoup(html,'html.parser')
13 #     links = bsobj.find_all('a',href=re.compile(r'^(/wiki/)'))
14 #     for link in links:
15 #         print(link['href'])
16 #
17 # getlinks('')
18 import requests
19 headers={
20         'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/53.0.2785.104 Safari/537.36 Core/1.53.4882.400 QQBrowser/9.7.13059.400',
21     }
22 
23 
24 resoponse_1 = requests.get(url='https://dig.chouti.com/',
25                            headers=headers
26                            )
27 cookie_dict = resoponse_1.cookies.get_dict()
28 
29 resoponse_2 = requests.post(
30     url='https://dig.chouti.com/login',
31     data={
32         'phone':'8615733239039',
33         'password':'woshinidie',
34         'oneMonth':'1',
35     },
36     headers=headers,
37     cookies=cookie_dict
38 )
39 for page in range(1,3):
40     html = requests.get(url='https://dig.chouti.com/all/hot/recent/{}'.format(page),headers=headers)
41     soup = BeautifulSoup(html.text,'html.parser')
42     divs = soup.find(name='div',id='content-list')
43     items = divs.find_all(attrs={'class':'item'})
44     for i in items:
45         click_id = i.find('img').get('lang')
46         if click_id:
47             print(click_id)
48             click_hand = requests.post(url='https://dig.chouti.com/link/vote?linksId={}'.format(click_id),
49                                        headers=headers,
50                                        cookies=cookie_dict,
51                                        )

猜你喜欢

转载自www.cnblogs.com/wangyajian/p/9156242.html

python爬虫基础（requests、BeautifulSoup）

python爬虫基础Ⅰ——requests、BeautifulSoup：书本信息

python爬虫基础入门——利用requests和BeautifulSoup

Python爬虫之BeautifulSoup和requests的使用

python爬虫之requests+selenium+BeautifulSoup

python3 爬虫（requests+BeautifulSoup）

Python网络爬虫笔记（四）——requests与BeautifulSoup

python爬虫爬取招聘（ requests，BeautifulSoup）

爬虫 - requests 和 BeautifulSoup

python爬虫基础知识之requests，读取图片的两只方式，cookies,beautifulsoup

【爬虫学习一】 Python实现简单爬虫（requests，BeautifulSoup）

python学习爬虫（1）--环境搭建Python+requests+BeautifulSoup

[Python][爬虫03]requests+BeautifulSoup实例:抓取图片并保存

python股票数据爬虫requests、etree、BeautifulSoup学习

python3 爬虫相关-requests和BeautifulSoup

Python学习笔记11：爬虫（requests和BeautifulSoup）

python爬虫使用requests和BeautifulSoup出现中文乱码

python 爬虫proxy,BeautifulSoup+requests+mysql 爬取样例

python3 --- 基于requests + beautifulsoup 实现爬虫项目

Python使用requests及BeautifulSoup构建爬虫实例代码

爬虫入门——requests和Beautifulsoup

爬虫：requests & BeautifulSoup 实战案例

python爬虫第一弹之图片- BeautifulSoup与requests的完美结合（用requests和BeautifulSoup进行爬虫）

Python爬虫基础—requests库

Python爬虫 requests库基础

【requests】------- PYTHON爬虫基础2

python网络爬虫指南一：网页基础（html/css/JavaScript）、网络请求（urllib/requests)、数据解析(XPath/BeautifulSoup)

python爬虫（5）——BeautifulSoup & docker基础

[python爬虫]Requests-BeautifulSoup-Re库方案--robots协议与Requests库实战

[python爬虫]Requests-BeautifulSoup-Re库方案--Requests库介绍

今日推荐

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

周排行

Family Tree 题解

BZOJ 1093 最大半连通子图 SCC + DP

幂等处理

Spring----学习（2）----XML 配置Bean 自动装配

SQL Server 远程更新目标表数据

HIbernate3.6 环境搭建

特殊符号正则表达式

【Linux】第一章进程的理解

843. n-皇后问题（dfs+输出各种情况）

空间数据库2

每日归档

更多

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)