python爬虫简单网页图片 - 代码天地

python爬虫简单网页图片

其他 2018-09-18 14:09:10 阅读次数: 0

#!/usr/bin/python
# coding:utf-8
# 实现一个简单的爬虫，爬取百度贴吧图片
import urllib
import re

# 根据url获取网页html内容
def getHtmlContent(url):
    page = urllib.urlopen(url)
    return page.read()

# 从html中解析出所有jpg图片的url
# 百度贴吧html中jpg图片的url格式为：<img ... src="XXX.jpg" width=...>
def getJPGs(html):
    # 解析jpg图片url的正则
    jpgReg = re.compile(r'<img.+?src="(.+?\.jpg)" width')  # 注：这里最后加一个'width'是为了提高匹配精确度
    # 解析出jpg的url列表
    jpgs = re.findall(jpgReg,html)

    return jpgs

# 用图片url下载图片并保存成制定文件名
def downloadJPG(imgUrl,fileName):
    urllib.urlretrieve(imgUrl,fileName)

# 批量下载图片，默认保存到当前目录下
def batchDownloadJPGs(imgUrls,path = './'):
    # 用于给图片命名
    count = 1
    for url in imgUrls:
        downloadJPG(url,''.join([path,'{0}.jpg'.format(count)]))
        count = count + 1

# 封装：从百度贴吧网页下载图片
def download(url):
    html = getHtmlContent(url)
    jpgs = getJPGs(html)
    batchDownloadJPGs(jpgs)

def main():
    url = 'https://www.duitang.com/blog/?id=143148191'
    download(url)

if __name__ == '__main__':
    main()

猜你喜欢

转载自blog.csdn.net/qq_35695041/article/details/81557834

python爬虫简单网页图片

Python 简单网页爬虫

爬虫-简单抓取网页图片

python3爬虫爬取网页图片简单示例

Python爬虫学习笔记一：简单网页图片抓取

Python3简单爬虫抓取网页图片

基于Python的网页图片爬虫

Python简单图片爬虫

python爬虫.1.简单的网页爬虫

python爬虫.3.下载网页图片

Python爬虫之网页图片抓取

Python学习---网页爬虫[下载图片]

Python——网络爬虫（爬取网页图片）

python爬虫-- 抓取网页、图片、文章

Python爬虫入门——爬取网页图片

python爬虫爬取网页图片

python爬虫：批量爬取网页图片

爬虫 python 正则匹配保存网页图片

Python爬虫1：简单抓取网页

Python 网页爬虫爬取网页图片demo

使用Python爬虫爬取简单网页（Python爬虫入门）

爬虫简易入门代码-爬取简单网页图片

使用python实现简单网页图片抓取

python学习----网页图片文字识别(简单)

爬虫抓取网页图片

python爬虫-简单的图片爬取实现

python(1)-实现简单的图片爬虫

python爬虫-简单使用xpath下载图片

python3实现简单图片爬虫

基于Python实现的爬虫与简单图片处理

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)