[Reptile] study notes day65 7.7 scrapy-redis combat -. Oranges distributed project 1

7.7 scrapy-redis combat -. IT oranges distributed project 1

Here Insert Picture Description
IT Orange is concerned about IT Internet industry structured corporate databases and business information services provider in the May 21, 2013 on the line.

IT Orange committed to the production of information and data aggregation, mining, processing, handling, help target users and customers save time and money, improve efficiency, to assist in its various areas of business conduct, including venture capital, acquisitions, competitive intelligence, fine sub-industry information, foreign companies product information and data services.

Used to be solely responsible for the content published or collected all disputes arising from the acquisition of content published or caused the loss, bear all direct or indirect (joint and several) liability by the content of the publication or gatherers, they do not bear oranges any legal liability.

Project acquisition Address: http://www.itjuzi.com/company

Requirements: All Companies under the startup page collection, including but not limited to the following:

# items.py

# -*- coding: utf-8 -*-
import scrapy

class CompanyItem(scrapy.Item):

    # 公司id (url数字部分)
    info_id = scrapy.Field()
    # 公司名称
    company_name = scrapy.Field()
    # 公司口号
    slogan = scrapy.Field()
    # 分类
    scope = scrapy.Field()
    # 子分类
    sub_scope = scrapy.Field()

    # 所在城市
    city = scrapy.Field()
    # 所在区域
    area = scrapy.Field()
    # 公司主页
    home_page = scrapy.Field()
    # 公司标签
    tags = scrapy.Field()

    # 公司简介
    company_intro = scrapy.Field()
    # 公司全称:
    company_full_name = scrapy.Field()
    # 成立时间:
    found_time = scrapy.Field()
    # 公司规模:
    company_size = scrapy.Field()
    # 运营状态
    company_status = scrapy.Field()

    # 投资情况列表:包含获投时间、融资阶段、融资金额、投资公司
    tz_info = scrapy.Field()
    # 团队信息列表:包含成员姓名、成员职称、成员介绍
    tm_info = scrapy.Field()
    # 产品信息列表:包含产品名称、产品类型、产品介绍
    pdt_info = scrapy.Field()
Published 289 original articles · won praise 94 · views 10000 +

Guess you like

Origin blog.csdn.net/qq_35456045/article/details/104111518