python第一个爬虫程序 - 代码天地

python第一个爬虫程序

其他 2018-11-26 14:10:52 阅读次数: 0

转载https://www.cnblogs.com/Axi8/p/5757270.html

把python2的部分改成python3了，爬取百度贴吧某帖子内的图片。

    #coding:utf-8
    import urllib.request#python3
    import re
    
    def get_html(url):
        page = urllib.request.urlopen(url)#打开网页
        html = page.read()#读取页面源码
        #html = html.decode(encoding='UTF-8')#python3
        html=html.decode('utf-8')#python3
        return html
        
    
    reg = r'src="(.+?\.jpg)" width'#正则表达式
    reg_img = re.compile(reg)#编译一下，运行更快
    imglist = reg_img.findall(get_html('http://tieba.baidu.com/p/1753935195'))#进行匹配
    x = 0
    for img in imglist:
        urllib.request.urlretrieve(img,'%s.jpg'% x)
        x += 1

猜你喜欢

转载自blog.csdn.net/qq_36616602/article/details/84062008

python篇-第一个爬虫程序

python第一个爬虫程序

第一个python程序：爬虫下载课件

第一个Python爬虫

Python 第一个爬虫

python第一个爬虫

python 爬虫《百炼成佛》爬虫入门（爬虫介绍）第一个爬虫程序

Python爬虫1：爬虫原理、网页构造与第一个爬虫程序

第一Python第一个爬虫项目

第一个python程序

python的第一个程序

python第一个程序

第一个 Python 程序

# 第一个 Python 程序

python 第一个程序】

【第一个Python程序】

Python爬虫入门——2. 1 我的第一个爬虫程序

Python网络爬虫学习笔记——第一个爬虫程序

Python爬虫之第一个爬虫

python爬虫1：第一个爬虫

我的第一个成功的爬虫程序

重写第一个爬虫程序

第一个get请求的爬虫程序

纪念跑通的第一个爬虫程序

我的第一个使用python写的爬虫程序

用Python第一个爬虫程序（urllib.request)

第一个Python程序与第一个Golang程序

python-入门的第一个爬虫例子

python第一个爬虫脚本

我的第一个python爬虫

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

static方法和非static方法的区别（java）

如何查找计算机专业paper

java.lang.ClassFormatError: Incompatible magic value 0 in class file com/sitecha

跳跃游戏II

stm32_之【建立工程】

TeaWeb v0.0.9 发布，统计底层优化、主机监控功能改进

事件分发 -----控制字体大小

JavaScript DOM练习（动态表格添加） December 25，2019

JSF Scope & CDI

实现从零搭建一个登录注册页面（附源代码）

每日归档

更多

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)