分享python3爬虫爬取百度上的图片 - 代码天地

分享python3爬虫爬取百度上的图片

其他 2020-03-04 10:38:42 阅读次数: 0

话不多说，先上代码

import urllib.request
import re

key="风景图片"
keyname=urllib.request.quote(key)
url="http://image.so.com/i?src=360pic_strong&z=1&i=0&cmg=1760c80a08bb4de0be404c0d98032520&q="+keyname
headers=("User-Agent","Mozilla/5.0 (Windows NT 10.0; WOW64; rv:35.0) Gecko/20100101 Firefox/35.0")
opener=urllib.request.build_opener()
opener.addheaders=[headers]
urllib.request.install_opener(opener)
pat='"_thumb_bak":"(.*?)"'
data=urllib.request.urlopen(url).read().decode('utf-8','ignore')
allurl=re.compile(pat).findall(data)
for i in range(0,len(allurl)):
    try:
        thisurl=allurl[i]
        this=thisurl.replace('\/','/')
        file="D:/C/爬虫学习网络体系/爬虫爬去内容/风景图片/"+str(i)+".jpg"
        urllib.request.urlretrieve(this,filename=file)

    except urllib.request.URLError as e:
        if hasattr(e,"code"):
            print(e.code)
        if hasattr(e,"reason"):
            print(e.reason)

其中有一个部分一直失败，经过好多次调试才成功

就是thisurl这个部分，用print得到的网址格式是这样：

http:\/\/p0.so.qhimgs1.com\/sdr\/_240_\/t01610a89bbf5ff392c.jpg

当时我也是一脸懵逼
但是后来想想“/"可以替换下就可以了，要说写这个爬虫也就是这点感觉比较坑。
别的都很好。
你也可以尝试用代理服务器去处理，当然这是比较简单的，微量级的爬虫就不需要了
话不多说，就分享到这吧！继续小爬虫的道路！！！

发布了60 篇原创文章 · 获赞 39 · 访问量 3747

私信关注

猜你喜欢

转载自blog.csdn.net/qq_42992704/article/details/86756021

分享python3爬虫爬取百度上的图片

python3 爬取百度图片

使用python3爬取百度图片

python3爬取百度图片

python3编写爬虫从百度图库中爬取图片

python3 学习 3：python爬虫之爬取动态加载的图片，以百度图片为例

python3爬虫(2):使用Selenium爬取百度文库word文章

Python3爬虫爬取百度贴吧

Python3爬虫-selenium爬取百度文库

Python爬虫案例：爬取百度图片

python爬虫，爬取百度图片

python爬虫爬取百度贴吧图片

python爬虫小程序,爬取百度图片

python爬虫爬取百度图片

python爬虫模拟登录爬取百度图片

python爬虫篇2：爬取百度图片

爬虫——百度图片爬取

Python3爬取百度文库数据

python3爬取百度Ajax渲染图片

python3 anaconda pycharm 爬取百度图片

百度图片爬虫-python版-如何爬取百度图片?

python3.x简单爬虫（爬取百度壁纸图片下载本地）

【python3】爬取百度图片，多线程爬取，自动局部刷新与翻页

python 3 爬取百度图片

百度图片爬虫 python3实现

【学习笔记】python3 爬虫-百度图片

Python爬取百度图片

Python 爬取百度图片

【Python】爬取百度图片

Python 百度图片爬取

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

周排行

Metasploit文件目录与入侵基本概念

跨域(CORS)请求问题[No 'Access-Control-Allow-Origin' header is present on the requested resource]常见解决方案

CodeIgniter 源码解读之 CodeIgniter.php（二）

SAS入门之（四）改变数据类型

初识元组

[数学建模]数学建模算法和模型（B站视频）（二）

Nginx 服务器源码安装配置流程

C#实现语音视频录制【基于MCapture + MFile】

开发进度4

下载安装vue的方法网址

每日归档

更多

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)