python3爬取网页图片路径并写入文件 - 代码天地

python3爬取网页图片路径并写入文件

其他 2019-04-22 00:24:42 阅读次数: 0

import re
import urllib.request

# 获取网页文件
def getHtml(url):
    response = urllib.request.urlopen('https://www.zhipin.com/?ka=header-home');
    return response.read();

# 写入数据到文件
def writeFile(fileName,data):
    # 打开文件方式为'a'可不覆盖原有数据
    htmlFile = open(fileName, 'a');
    htmlFile.write(data);
    htmlFile.close();

# 截取后缀为.jpg的图片
def getImgSrc(fileName):
    # decode()将string转为byte
    imgUrl = re.findall(r'https:.+\.jpg',fileName.decode('utf-8'));
    return imgUrl;

html = getHtml('https://www.imooc.com/');
print(html);

imgUrl = getImgSrc(html);
for i in imgUrl:
    print(i);
    writeFile('imgUrl.txt', i);

猜你喜欢

转载自www.cnblogs.com/mxh-java/p/10747808.html

python3爬取网页图片路径并写入文件

python3爬取网页图片

python3爬虫爬取网页图片简单示例

python3爬虫之二：爬取网页图片

python3 从网页上爬取图片

Python3 使用request模块爬取网页的图片

python3 爬取天气网页

python3爬取图片

Python3：小爬虫成长记（三）---爬取数据并写入到文件

python3下爬取网页上的图片的爬虫程序

用python3从网页中爬取图片下载到本地

python3将爬取的数据写入execl表格

python3 爬取36氪新闻网页

Python3 -- 基于Splinter工具爬取网页资源

python3 url 爬取网页并读写

Python3 爬取Ajax加载的网页信息

Python3 Ajax加载的网页爬取

python3定向爬取网页内容

python3 爬取网页表格例子

python3 爬取网页的异常处理

python3爬取网页中的邮箱地址

Python3 使用urllib 爬取网页

python3 opencv中文路径图片读取和写入

Python爬取网页图片

【python】爬取网页图片

python3爬取女神图片，破解盗链问题

python3 爬取百度图片

使用python3爬取百度图片

python3爬虫爬取煎蛋网妹纸图片

Python3 urllib 爬取花瓣网图片

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)