scrapy instantiation - 代码天地

scrapy instantiation

其他 2018-10-24 09:08:32 阅读次数: 0

start

from scrapy.cmdline import execute
execute(['scrapy', 'crawl', 'jokespider'])

　　

items.py

import scrapy

class JokejiItem(scrapy.Item):
    title=scrapy.Field()
    url=scrapy.Field()

class ListItem(scrapy.Item):
    title=scrapy.Field()
    url=scrapy.Field()

　　

spider.py

from scrapy.linkextractors import LinkExtractor
from scrapy.spiders import CrawlSpider, Rule
from jokeji.items import JokejiItem,ListItem

class JokespiderSpider(CrawlSpider):
    name = 'jokespider'
    allowed_domains = ['zizi.cn']
    start_urls = ['http://www.zizi.cn']

    rules = [
        Rule(LinkExtractor(allow=r'/list\w+.htm'), callback='parse_list', follow=True),
        Rule(LinkExtractor(allow=r'/jokehtml/\w+/\d+\.htm',deny=(r'/list')), callback='parse_item', follow=True),
    ]

    def parse_item(self, response):
        item=JokejiItem()
        item['title']='from content'
        return item

    def parse_list(self,response):
        item=ListItem()
        item['url']="from list........"+response.url
        return item

　　

pipelines.py

class JokejiPipeline(object):
    def process_item(self, item, spider):
        print(item)

　　

猜你喜欢

转载自www.cnblogs.com/pythonClub/p/9841509.html

scrapy instantiation

Understanding Python Class Instantiation

BeanPostProcessor before instantiation of bean failed

Bean实例化（Instantiation）多种方式

scrapy

sap 7.40 新特性 02- Instantiation Operator NEW

Spring报错Bean instantiation via factory method failed StackOverflowError

Instantiation of bean failed；nested exception is org.springframework.beans.BeanInstantiationExcept:

Error creating bean with name ‘application‘: Instantiation of bean failed；

Web Method problem:Class java.util.Map not public or does not allow instantiation

dubbo报错-Instantiation of bean failed; nested exception is java.lang.ExceptionInInitializerError

项目启动时报错Instantiation of bean failed; nested exception is java.lang.ExceptionInInitializerError

spring项目启动报错:Instantiation of bean failed; nested exception is java.lang.ExceptionInInitializerError

Error creating bean with name 'xxx.xx.xRequestMappingHandlerAdapter' Instantiation of bean failed

解决 Registered driver with driverClassName=oracle.jdbc.driver.OracleDriver was not found, trying direct instantiation.

TLA+ 《Specifying Systems》翻译初稿——Section 4.2 Instantiation Examined(审视实例化)

【工程源码】【Modelsim常见问题】vsim-3033 Instantiation of ‘xxxx’ failed

Spring AOP Aspect Instantiation Models 切面实例化模型

Bean初始化错误：Instantiation of bean failed； nested exception is java.lang.ExceptionInInitializerError

项目启动时报错Instantiation of bean failed； nested exception is java.lang.ExceptionInInitializerError

Scrapy终端（Scrapy shell）

scrapy程序（scrapy）

scrapy的xpath，scrapy shell

scrapy 初识 scrapy框架

Scrapy框架 Scrapy框架

scrapy的使用-scrapy shell

Scrapy：Scrapy shell

scrapy 命令

anjuke scrapy

Scrapy 框架

今日推荐

技术解析 GPT-4o：即时语音交互的突破与 GenAI 发展策略

开源大模型与闭源大模型

微信小程序授权登录获取用户的openid

亿级流量系统架构设计与实战

人工智能时代的程序设计教学与课程设计

纽交所技术问题致伯克希尔 (BRK.A) 显示跌近 100%

周排行

ORACLE 跟踪文件详细解释

20190924-LeetCode解数独题目分享

分治法实例-找下标，下标与对应值相等

安全测试学习笔记

JavaScript笔记：原型和原型链

在Linux中检查可用内存的5种方法

BUAA_OO_JML

mongodb创建用户、备份、恢复等

生活20190602

使用MoveIt!配置软件包在RViz中进行机器人运动规划

每日归档

更多

2024-06-09(0)

2024-06-08(0)

2024-06-07(0)

2024-06-06(0)

2024-06-05(0)

2024-06-04(10)

2024-06-03(52)

2024-06-02(4)

2024-06-01(60)

2024-05-31(47)