Scrapy command-line tool - Code World

Scrapy command-line tool

Others 2019-08-02 14:00:31 views: null

The default project structure 1.Scrapy

scrapy.cfg
myproject/
    __init__.py
    items.py
    middlewares.py
    pipelines.py
    settings.py
    spiders/
        __init__.py
        spider1.py
        spider2.py
        ...

scrapy.cfgThere lies the root directory of the project. This file contains a description of the configuration file.

[settings]
default = myproject1.settings
project1 = myproject1.settings
project2 = myproject2.settings

It will be used by default defaultthis configuration. You can use SCRAPY_PROJECTenvironment variables to specify a different projects.

export SCRAPY_PROJECT=project2
scrapy settings --get BOT_NAME

Use 2.scrapy tools

Run directly help explain:

scrapy

Create a project:

scrapy startproject myproject [project_dir]

Control Project:

scrapy genspider mydomain mydomain.com  # 创建一个新的Spider

Tools available commands:

scrapy -h  # 查看所有可用命令
scrapy <command> -h  # 查看对应命令的帮助

全局命令：
startproject：scrapy startproject <project_name> [project_dir]  # 创建项目
genspider：scrapy genspider [-t template] <name> <domain>  # 创建新的Spider
settings：scrapy settings [options]  # 获取配置
runspider：scrapy runspider <spider_file.py>  # 运行Python文件里的Spider，不需要创建项目
shell：scrapy shell [url]  # 启动Scrapy交互终端，可用于调试
fetch：scrapy fetch <url>  # 使用Scrapy下载器下载给定的URL，并将获取的内容送到标准输出
view：scrapy view <url>  # 浏览器中打开URL
version：scrapy version [-v]  # 打印版本

项目命令：
crawl：scrapy crawl <spider>  # 使用spider进行爬取
check：scrapy check [-l] <spider>  # 运行contract检查
list：scrapy list  # 列出当前项目可用的所有spider
edit：scrapy edit <spider>  # 使用EDITOR环境变量中设定的编辑器编辑spider
parse：scrapy parse <url> [options]  # 获取给定的URL并使用spider分析处理
bench：scrapy bench  # 运行benchmark测试

Guess you like

Origin www.cnblogs.com/Ooman/p/11223783.html

Scrapy command-line tool

SpringBootCLI command-line tool

PowerCMD - cmd command-line tool

Fast write node command-line tool

Chapter 3: create command-line tool

Windows command-line tool to configure cmder

java command-line tool package

node write command-line tool

Twelve, ARP spoofing command-line tool

Redis command-line tool can be used so you know it?

k8s command-line tool - kubectl

With Python argv and input () command-line tool production

Python command-line parameter analysis tool argparse

Python magnetic obtain command-line tool torrent-cli

Check the window 10 at the specified command-line tool is located

HTTPie 2.0.0 release, HTTP command-line tool bag

ROS practice manual (two) command-line tool

Heavy! The new official GitHub open source command-line tool

Written using .Net Core command-line tool (CLI)

[Amad] cookiecutter - a command-line tool to build the project using a project template

Laravel command-line tool as much as thread synchronization large quantities of data DB connection confusion Solutions

Using command-line tool UCloud CLI, lightweight operating cloud resource management API

PathMarker: command-line tool to quickly edit jump (with git, find, etc.)

Fishing in troubled waters but also have the skills, 3 linux command-line tool lets you pretend busy

Front-end test automation jest Lesson 3 command-line tool

snzip - hadoop-snappy unzip files on the Linux command-line tool

Linux-Terminator, a command-line tool that can split the terminal screen

gops: a command-line tool to list and diagnose running Go processes on your system

xterm.js is a command-line tool for linking containers based on websockets

Shutdown an OSGi container after some code has been executed (to create a command-line tool)

Recommended

Domestic cloud input method - only Huawei has no cloud data upload security issues

Open Source Daily | Industrial open source project OGG 1.0; Sister, do you want to configure Firefox with me? Apple AI is far behind? Fedora 40

Ranking

Deploy Tomcat (Web) services Comments

jquery: event mechanism bind() on()

The simple and easy-to-understand basic encapsulation module makes Web testing easier!

Oracle Database on Linux solutions Chinese garbled

Talk about the REST API of Eureka Server

Instructions for companies applying for ISO certification

Spring mvc framework adds encryption/decryption of messages

Transport layer protocol-introduction of TCP protocol and the process of three-way handshake and four-time disconnection, and the difference with UDP protocol

MockMvc and Mockito - java.lang.AssertionError: JSON path "$" Expected: a collection with size <2> but: collection size was <0>

Redis cluster creation

Daily

More

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(31)

2024-04-16(23)

2024-04-15(5)