scrapy rewrites the Request method - Code World

scrapy rewrites the Request method

Others 2021-03-01 11:44:23 views: null

I wanted to crawl the news of a website in batches. This page-turning url is too simple. Just replace and adjust the p parameter in the external link to
write it like this

urls = ['https://www.baidu/p=%s'%(i) for i in range(1,11)]

However, the parse() method that comes with it cannot use the response.follow() method to submit the next link after parsing, and then the next method is executed.

So I rewrite the Request method

import scrapy
from article.items import ArticleItem
from scrapy import Request

class XinwenSpider(scrapy.Spider):
    name = 'xinwen'
    allowed_domains = ['www.hbskzy.cn']



    def start_requests(self):
        urls = ['http://www.hbswkj.com/index_list.jsp?a1032t=44&a1032p=%s&a1032c=20&urltype=tree.TreeTempUrl&wbtreeid=1021'%(i) for i in range(2,5)]
        for i in urls:
            yield Request(url=i,callback=self.next_parse)

Since the response.follow() method needs to execute the links in start_urls, the situation of using this method is suitable for: the URL of the next page needs to be parsed after the webpage is parsed. In this case, response.follow() is suitable for use, otherwise, an error The application scenario is extremely prone to errors.

Guess you like

Origin blog.csdn.net/qq_17802895/article/details/108545617

scrapy rewrites the Request method

typeof method rewrites array deduplication

Summernote rich text editor rewrites the picture illustration method_jquery

Analytical request scrapy

scrapy in the Request and Response objects

scrapy use -Request

Scrapy Source Request object

Request Method

The method of HTTP request -Request Method

scrapy installation method

Scrapy spider primary method

Debugging method of scrapy

Debugging method of scrapy

Scrapy crawler method

Scrapy framework: Request a callback function

How to add parameters to the request in scrapy

Java's idea shortcut key generates getter and setter, has construction parameters, no construction parameters, rewrites toString method

The http request method (HTTP Method)

The method of packaging tools Request

Several http request method

Http request method

Four Ajax request method

The HTTP request method

Request method get and post

laravel actual request method

Several HTTP request method

REST style request method

Nginx limit request method

HTTP request method

request server method

Recommended

The United States plans to restrict the export of large AI models to China and Russia

Apple to reach agreement with OpenAI to bring ChatGPT to iPhone

Ranking

whisper-webui installation tutorial is silky and easy to use

[Base] Laravel concepts laravel basis, the custom service provider: Contracts, ServiceContainer, ServiceProvider, Facades relations

Import torchvision error problem solving DLL: module not found

observer & watch & notify = pub & sub

A small turntable program [HTML + CSS + JS]

CorelDRAW 2018 shortcuts Daquan

Supervise el botón de menú para lograr un gatillo de presión prolongada

JS将时间秒转换成天小时分钟秒的字符串

RIP basic configuration

[Deleted] solution to a problem a few questions (Noip1994)

Daily

More

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)