py爬虫 —— 三个爬虫的小栗子

其他 2019-11-17 11:34:11 阅读次数: 0

三个爬虫的小栗子

第一个例子 —— 京东商品的爬取案例

import requests


def getHTMLtext(url):
    try:
        r = requests.request('get' ,url )
        r.raise_for_status()
        r.encoding = r.apparent_encoding
        return r.text
    except:
        return "出现异常"
url = "https://item.jd.com/100005477055.html"
print(getHTMLtext(url))

结果如下：

第二个例子 —— 亚马逊爬虫案例

import requests

url = 'https://www.amazon.cn/dp/B00RY59GJ0?ref_=Oct_DLandingSV2_PC_e9324a46_0&smid=A2EDK7H33M5FFG'

r = requests.request('get',url)

print(r.status_code)

header = {'user-agent': 'Mozilla/5.0'}

r = requests.request('get',url,headers = header)

print(r.headers)

print(r.status_code)

print(r.text)

亚马逊的官网是反爬虫的，因此我们要将user-agent更换

第三个例子 —— 百度的post案例

猜你喜欢

转载自www.cnblogs.com/Nlifea/p/11875649.html

py爬虫 —— 三个爬虫的小栗子

三个爬虫的小栗子

py爬虫 —— py爬虫requests

爬虫——三个小实战

(PY爬虫03)爬虫初识

Py爬虫项目

【PY爬虫】Request库

py爬虫姿势

Py爬虫学习_requests库

Py爬虫学习_urllib库

Scrapy爬虫-pipeline.py

爬虫-----lagou2.py

爬虫-------lagou1.py

py爬虫task1

爬虫的三个常用库

基础爬虫------三个简单爬虫案例(很funny)

爬虫---基础语法及案例 py-2

爬虫----基础语法及案例 Py-3

Py3异步爬虫浅涉

python爬虫(十八)-------------------scrapy piplines.py

py-02-爬虫比价器

[python爬虫]常用user_agent.py

(PY爬虫02) 制定爬虫的学习计划了

python爬虫scrapy比较常用的三个命令

Python-网络爬虫三个流程的实现

爬虫实践：陕西招投标爬虫（三个网站）xpath封装并exe

Scrapy爬虫之settings.py配置文件详解

Tieba_Spider(爬虫)（py2.xx）

小白的py爬虫学习笔记_1_2

0724py:urllib.request模块爬虫初步了解

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)