Python Crawler(3)Services - 代码天地

Python Crawler(3)Services

企业开发 2018-05-09 15:29:19 阅读次数: 3

Python Crawler(3)Services

Local Machine Service
Start the Service
>scrapyd

Call to start the services
>curl http://localhost:6800/schedule.json -d project=default -d spider=author
{"status": "ok", "jobid": "3b9c84c28dae11e79ba4a45e60e77f99", "node_name": "ip-10-10-21-215.ec2.internal"}

More API
http://scrapyd.readthedocs.io/en/stable/api.html#api

Call to Pass a Parameter
>curl http://localhost:6800/schedule.json -d project=myproject -d spider=somespider -d setting=DOWNLOAD_DELAY=2 -d arg1=val1

List Projects
>curl http://localhost:6800/listprojects.json
{"status": "ok", "projects": ["default", "tutorial"], "node_name": "ip-10-10-21-215.ec2.internal”}

List Spiders
>curl http://localhost:6800/listspiders.json?project=default
{"status": "ok", "spiders": ["author", "quotes"], "node_name": "ip-10-10-21-215.ec2.internal"}

UI of Status
http://localhost:6800/

http://scrapyd.readthedocs.io/en/stable/overview.html

Clustered Solution ?
https://github.com/rmax/scrapy-redis

References:
http://scrapyd.readthedocs.io/en/stable/overview.html#how-scrapyd-works

猜你喜欢

转载自sillycat.iteye.com/blog/2391685

Python Crawler(3)Services

Python Crawler

Python Crawler(4)Selenium

python crawler(2)

python crawler(1)

Python Crawler(5)Deployment on RaspberryPi

Python Crawler(2)Items and Pipelines

Python Crawler(1) - Scrappy Introduce

Crawler - python常用爬虫框架

python web services (soaplib)

Python Monitor Water Falls(4)Crawler and Scrapy

Python_Crawler_Foundation0-1

python--web crawler-II

python 编写的DHT Crawler 网络爬虫

A rookie of python_crawler----1(tf)

A rookie of python_crawler----2(dict)

python crawler 爬虫学习资料【干货】

crawler

services_day3

Crawler：Python之Crawler爬取抖音账号的信息数据

Crawler：Python之Crawler爬取12306网站来实现快速抢票

Python Crawler(6)Deployment on Docker on EC2

Python爬虫：Scrapy的Crawler对象及扩展Extensions和信号Signals

Crawler：Python爬取14年所有的福彩信息，利用requests库和BeautifulSoup模块来抓取中彩网页福彩3D相关的信息，并将其保存到Excel表格中

python写重启windows services脚本

使用 Python 和 Flask 实现 RESTful services

Services

【暗恋不可耻但无用】QQ空间爬虫-Python版（pyzone-crawler）

python_crawler/自动下载文件夹下哨兵的精轨数据.py

网络爬虫Crawler~~~~python 爬虫~~~省市区~~抓~~快递公司~多线程

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

curl的POST请求，封装方法

8.1.1. Integer Types

Java基础 Day05(个人复习整理)

Python - Django - 中间件 process_exception

小L的试卷

【Shell编程】（函数）判断用户是否存在

python(css样式)

spring ant path 匹配原则 - 【笔记】

《JavaScript与JScript从入门到精通》(美)James.Jaworski.中译本.扫描版.pdf

Eclipse运行带参数的java程序

每日归档

更多

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)