python reptile learning (eight) regular expressions batch crawling sister pictures

Others 2020-03-26 15:22:55 views: null

Regular get a piece of quite a long time, a little bit to try

# -*- coding: utf-8 -*-
import requests
import re
import os
if __name__ == '__main__':
    #创建一个文件夹，保存所有图片
    if not os.path.exists('./MMLibs'):
        os.mkdir('./MMLibs')

    headers = {
        'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/76.0.3809.87 Safari/537.36'
    }

    url='https://www.2717.com/tag/434.html'
    #使用通用爬虫对url对应的页面整张爬取
    page_text = requests.get(url=url,headers=headers).text
    #使用聚焦爬虫进行解析
    #正则表达式
    ex = '<li>.*?<img.*?src="(.*?)".*?</li>'
    #re.s 单行匹配 re.m多行匹配
    img_src_list= re.findall(ex,page_text,re.S)
    #print(img_src_list)
    for src in img_src_list:
        img_data = requests.get(url=src,headers=headers).content
        #生成图片名称
        img_name= src.split('/')[-1]
        #图片储存路径
        imgPath = './MMLibs/'+img_name
        with open(imgPath,'wb')as fp:
            fp.write(img_data)
            print(img_name,"下载成功")

python folder operation
to find the current path, create a new file folder storage conversion

All operations on the file folder to be added in front of './' or '/', such as './file1', '/ file1'.

file_path = os.getcwd()     #找到当前文件路径
file_name = "./pinyin"	#新文件夹名字 
isExists = os.path.exists(file_name)	#判断这个文件夹是否存在
if isExists:
    print(file_path + file_name + "目录已存在")

else:
    os.mkdir(file_path + file_name)
    print(file_path + file_name + "目录创建成功")

Here Insert Picture Description

Published 23 original articles · won praise 0 · Views 673

Private letter concerns

Guess you like

Origin blog.csdn.net/haimian_baba/article/details/103732703

python reptile learning (eight) regular expressions batch crawling sister pictures

python reptile learning (xiii) xpath crawling sister pictures

python reptile learning (seven) crawling single sister pictures

Python Reptile (eight) _ Regular Expressions

Simple python reptile tutorial: batch crawling pictures

A reptile: a regular expression crawling pictures

Python Reptile Project: Best not crawling sister site specified number of pages pictures

python regular crawling pictures

Reptile Learning - (2) Regular Expressions

python reptile Batch download pictures

learning python reptile pictures

Acquaintance of reptiles python: using regular expressions crawling "Encyclopedia of embarrassments - text version of" web of data acquaintance python reptile: use regular expressions crawling "ancient poetry" Web data

[Learning in Python] Regular expressions

[Python] reptiles crawling beautiful little sister pictures beautiful wallpaper

Data analysis-regular expressions-crawling pictures on Wikipedia

The road data - Python Reptile - Regular Expressions

06 Python reptile of Re (regular expressions) library

Python reptile of regular expressions and re module

Python study notes (eight) - Regular Expressions

Python learning (3) crawling pictures

Python Reptile practice - Regular Expressions (1) regular expression syntax

Python learning-regular expressions

Learning python reptiles (IX) sister FIG tab crawling

Batch crawling pictures

Reptile learning - commonly used regular expressions Day3

Python Reptile introductory tutorial: US Space Network is not logged pictures crawling

python reptile of the regular expression cat crawling in front of the movie 100 (g)

Python tutorial reptile, crawling batch download video vibrato

[Python reptile road day8]: Regular Expressions

Automobile Reptile House (regular expressions)

Recommended

Ranking

PAT Level B 1094 Google's Recruitment (20 points) Python

Old Wei wins the offer to take you to learn --- brush (integer number 31. 1 appears) title series

Bilibili: Barrage Screening: Baijiaxing: Import XML directly

개체 배열 중복 제거

matplotlib—patches.Circle

[Observation] It is also a cordless vacuum cleaner, why does Dyson V10 dare to sell 4990 yuan?

0410

Apache Kafka message delivery reliability analysis

How hotel occupancy records inquiry etj

Essential knowledge for getting started with speculating in spot silver

Daily

More

2024-04-28(12)

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)