抓取所有的文本信息yuke


```bash
import scrapy
import json


class QiubaiSpider(scrapy.Spider):
    name = 'qiubai'
    start_urls = ['https://mp.weixin.qq.com/s/n7E2PCcXppbLMXtR_Um-FA']

    def parse(self, response):
        # div_list = response.xpath(
        #     '//*[@id="js_content"]/section[2]/section[1]/section/section[2]/section/p')
        div_list = response.xpath('//*[@id="js_content"]//text()').extract()
        print(div_list)
        fp = open('./语文重点.txt', 'w', encoding='utf-8')
        json.dump(div_list, fp, ensure_ascii=False)


猜你喜欢

转载自blog.csdn.net/xiaoxiamimm/article/details/112425469
今日推荐