爬虫爬妹子图

其他 2019-10-13 12:21:13 阅读次数: 0

代码，待优化

import requests
from bs4 import BeautifulSoup

url = "http://di81.com/PicList?pageindex=1"
response = requests.get(url)
response.encoding = 'utf8'
soup = BeautifulSoup(response.text, 'html.parser')
ul = soup.find(name="ul", attrs={'id': 'pins'})
a_list = ul.find_all(name='a')

for a in a_list:
    img = a.find('img')
    if img:
        continue
    href = a.attrs.get('href')
    title = a.text
    img_url = 'http://di81.com' + href
    img_response = requests.get(img_url)
    soup2 = BeautifulSoup(img_response.text, 'html.parser')
    div = soup2.find(name='div', attrs={'class': 'main-image'})
    img2_list = div.find_all('img')
    for img2 in img2_list:
        src2 = img2.attrs.get('src')
        img_src = "http://di81.com" + src2
        src_response = requests.get(img_src)
        img_con = src_response.content
        path = r'D:\05_Code\SpiderTest\img'
        file_name = img_src.rsplit('/', maxsplit=1)[1]
        file_path = path + "\\" + title + file_name
        with open(file_path, 'wb') as f:
            f.write(img_con)
        print(file_path)

猜你喜欢

转载自www.cnblogs.com/0bug/p/11665807.html

爬虫爬妹子图

python爬虫-爬妹子图

爬虫练习--爬妹子图

爬虫爬取清纯妹子图

爬妹子图的爬虫小程序

[python爬虫]爬取妹子图

Python爬虫教程：爬取妹子图

爬虫--多进程爬取妹子图

爬虫--lxml爬取妹子图

python爬虫——爬取妹子图

萌新爬虫的动力就是爬取妹子图！批量爬取妹子图哟！

Node.js爬取妹子图-crawler爬虫的使用

爬虫之煎蛋网妹子图大爬哦

python 爬虫爬取煎蛋网妹子图

Python 爬虫入门(二)——爬取妹子图

[python爬虫] 使用多进程爬取妹子图

Python爬虫——利用requests模块爬取妹子图

Python 爬虫入门之爬取妹子图

多线程爬取妹子图 python 爬虫

多线程爬虫爬取妹子图网站

Python爬虫入门教程：爬取妹子图网站

Python爬虫入门【2】：妹子图网站爬取

Python之Scrapy爬虫实战--爬取妹子图

Python 爬虫（清纯）妹子图爬取（代码自由奔放）

python爬虫正则表达式爬妹子图

python爬虫30秒爬取1000张妹子图

python爬虫学习（九）妹子图分页爬取

【Python爬虫】使用代理爬取妹子图

爬取妹子图

妹子图爬虫

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

周排行

Python环境安装与基础语法（1）——计算机基础知识

IMU预积分

ADAS中的LDW、FCW、BSD、LCA、ACC、AEB、APA、DMS代表的含义

B站笔试两道题

skyeye arm 硬件虚拟机环境的搭建

Web前端静态页面示例

数组-合并排序数组 II-简单

springcloud之版本问题启动报错

面向对象-------------匿名对象(六)

输入URL到页面呈现中间发生了什么？

每日归档

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)