爬虫content-type - 代码天地

爬虫content-type

其他 2018-10-10 10:20:08 阅读次数: 0

		self.headers['referer'] = self.url_target
		# 设置content-type,可以获取数据,默认没有数据
        self.headers['content-type']="text/javascript; charset=utf-8"
        
        response = self.s.get(self.url_target + "/lists", proxies=self.proxies, headers=self.headers,
                              cookies=self.cookies_dict)
        # res_dict=json.loads(response.text)
        # html=res_dict['page']
        # print html
        res=etree.HTML(response.text)

        lists=res.xpath('//div[@class="GridTimeline-items has-items"]/div')
        list_=[]
        for l in lists:
            item={}
            item['name']=l.xpath('./div/div/a[1]/text()')[0].strip()
            item['url']="https://twitter.com"+l.xpath('./div/div/a[1]/@href')[0]
            item['builder']=l.xpath('./div/div/span/a/text()')[0].strip()
            item['members']=l.xpath('./div/div/div/p//text()')[0].strip()
            list_.append(item)
        print list_

猜你喜欢

转载自blog.csdn.net/wu0che28/article/details/82788816

爬虫content-type

Content-type

content-type 解析

content-type类型

Content-Type记录

关于content-type

Content-Type详解

Accept与Content-Type

@RequestBody与Content-type

file content-type

Accept 与 Content-Type

content-type:

HTTP content-type

HTTP - Content-Type

mime content-type

了解content-type

http之content-type

accept 与 content-type 的区别

实体首部:Content-Type

HTML Content-Type 大全

设置返回的Content-Type

PHP Content-type 的说明

response设置content-type

Content-Type的常用格式

网络请求content-type

content-type的几种取值

HTTP协议：Content-Type

content-type属性值

HttpClient的Content-Type设置

HTTP 之 Content-Type

今日推荐

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

周排行

循环神经网络（rnn）讲解

Tigao教程四：单独的关节运动

金蝶K3WISE15.0-注册套打教程

如何在Mac上配置Kubernetes

Android应用结束自身进程的方法

SpringMVC学习十三拦截器栈

中国驻洛杉矶总领馆举行新春招待会

HttpClient get post 发送

11 - three.js 笔记 - 绘制三维字体模型

Mysql递归获取某个父节点下面的所有子节点和子节点上的所有父节点

每日归档

更多

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)