Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

其他 2018-05-08 23:00:00 阅读次数: 4

scrapy，写了一个简单的python爬虫项目，功能是采集某小说网站的全部小说，保存到本地

送给刚刚学习scrapy的python朋友学习。

Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

部分Python代码：

# -*- coding: utf-8 -*-

# Define your item pipelines here

# Don't forget to add your pipeline to the ITEM_PIPELINES setting

# See: https://doc.scrapy.org/en/latest/topics/item-pipeline.html

import os

class BiqugePipeline(object):

def process_item(self, item, spider):

#return item

curPath = 'E:/小说/'

tempPath = str(item['name'])

targetPath = curPath+ tempPath

#print('-----')

#print(targetPath)

if not os.path.exists(targetPath):

os.makedirs(targetPath)

filename_path = targetPath+'/'+ str(item['chapter_name']) + '.txt'

print('------')

print(filename_path)

print(item['chapter_content'])

with open(filename_path, 'a', encoding='utf-8') as f:

f.writelines(item['chapter_content'])

return item

猜你喜欢

转载自blog.csdn.net/qq_41841569/article/details/80225544

Scrapy爬取全网小说到本地TXT，Python少年最爱的一个爬虫项目！

Python爬虫层层递进，从爬取一章小说到爬取全站小说

如何用python爬虫从爬取一章小说到爬取全站小说

python 爬取整本小说到本地文件

五分钟写一个小爬虫，爬取小说并写入txt文件

【Python爬虫】轻松几步将一个 scrapy项目变成 scrapy_redis 分布式爬取

一个爬虫从网页中爬取小说

Python笔记（五） --写一个爬虫对新笔趣阁的小说进行爬取

scrapy爬取小说(一）

Python爬虫入门实战系列（一）--爬取网络小说并存放至txt文件

一个简单的使用scrapy爬取小说的例

Scrapy 学习笔记 - 一个练手任务，爬取起点的全部小说名

python爬虫五：爬取小说，下载到本地

我的第一个python爬虫程序——爬取网络小说（含错误及源码）

Python爬虫——爬取小说

Python爬虫之Scrapy框架系列（14）——实战ZH小说爬取【多页爬取】

Python爬虫实战项目之小说信息爬取

python爬虫-利用scrapy框架完成天天书屋内容爬取，并保存本地txt

scrapy 爬取小说

scrapy爬取小说

python爬虫--一次爬取小说的尝试

python爬虫之类的方法爬取一部小说

【python实现网络爬虫（5）】第一个Scrapy爬虫实例项目（Scrapy原理及Scrapy爬取名言名句网站信息）

一个简单的爬取小说的python程序彻底搞懂Python的字符编码

python-scrapy爬取小说下载网小说

Python爬虫练习爬取网络小说保存到txt

Python爬虫实战，requests+openpyxl模块，爬取小说数据并保存txt文档（附源码）

爬虫：Scrapy爬取第一个网页实例解析

小说免费看！python爬虫框架scrapy 爬取纵横网

Python爬虫—爬取小说名著

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

Java基础复习_day13_Collection集合

2018.11.16 c语言学习经验

且看Java内置四大核心函数式接口

小程序云开发中数据库的数据分段和显示图片

python的函数

Web-JS进阶

【干货】C++常用代码积累笔记大全

Spring的ioc操作与 IOC底层原理

构建之法20191121-11 Scrum立会报告+燃尽图 07

Spring boot之Hello World访问404

每日归档

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)