Python 爬虫招聘信息 - 代码天地

Python 爬虫招聘信息

其他 2018-08-19 16:44:27 阅读次数: 0

新学习了selenium,啪一下腾讯招聘

 1 from lxml import etree
 2 from selenium import webdriver
 3 import pymysql
 4 def Geturl(fullurl):#获取每个招聘网页的链接
 5     browser.get(fullurl)
 6     shouye_html_text = browser.page_source
 7     shouye_ele = etree.HTML(shouye_html_text)
 8     zp_list = shouye_ele.xpath('//*[@id="position"]/div[1]/table/tbody/tr/td/a/@href')#链接url
 9     zp_url_list  = []
10     for zp_url_lost in zp_list:
11         zp_url  = 'https://hr.tencent.com/'+zp_url_lost
12         zp_url_list.append(zp_url)
13     return zp_url_list
14 def Getinfo(zp_url_list):#获取每个招聘链接内部的内容
15     for zp_url in zp_url_list:
16         browser.get(zp_url)
17         zp_info_html = browser.page_source
18         zp_ele = etree.HTML(zp_info_html)
19         zp_info_title = str(zp_ele.xpath('//*[@id="sharetitle"]/text()')[0])
20         zp_info_location = str(zp_ele.xpath('//*[@id="position_detail"]/div/table/tbody/tr[2]/td[1]/text()')[0])
21         zp_info_type = str(zp_ele.xpath('//*[@id="position_detail"]/div/table/tbody/tr[2]/td[2]/text()')[0])
22         zp_info_num = str(zp_ele.xpath('//*[@id="position_detail"]/div/table/tbody/tr[2]/td[3]/text()')[0])
23         zp_info_need = str(zp_ele.xpath('//*[@id="position_detail"]/div/table/tbody/tr[3]/td/ul/li/text()'))
24         connection = pymysql.connect(host='localhost', user='root', password='1234', db='txzp', )
25         try:
26             with connection.cursor() as cursor:
27                 sql = "INSERT INTO `txzp_info` (`title`, `location`,`type`,`num`,`need`) VALUES (%s,%s,%s,%s, %s)"
28                 cursor.execute(sql, (zp_info_title,zp_info_location,zp_info_type,zp_info_num,zp_info_need))
29             connection.commit()
30         finally:
31             connection.close()
32         print(zp_info_title,zp_info_location,zp_info_type,zp_info_num,zp_info_need)
33 if __name__ == '__main__':
34     browser = webdriver.Chrome()
35     pags = int(input('需要几页?'))
36     for i in range(0,pags):
37         url = 'https://hr.tencent.com/position.php?keywords=&tid=0&start={}'
38         fullurl = url.format(str(i*10))
39         zp_url_list = Geturl(fullurl)
40         Getinfo(zp_url_list)
41     browser.close()

猜你喜欢

转载自www.cnblogs.com/pantom0122/p/9501578.html

Python 爬虫招聘信息

python爬虫--招聘信息

python爬虫实例--tencent网站招聘信息

python爬虫3——爬取腾讯招聘全部招聘信息

Python爬虫(爬取招聘网站信息)

Python爬虫获取招聘网站职位信息

Python爬虫某招聘网站的岗位信息

腾讯招聘信息爬虫

python3爬虫 -----爬取职位招聘信息-------from腾讯社会招聘

【爬虫系列】Python爬虫实战--招聘网站的职位信息爬取

python抓取招聘信息

python搭建简单爬虫框架，爬取猎聘网的招聘职位信息

python3爬虫抓取智联招聘职位信息代码

python3 爬虫爬取智联招聘岗位信息

【python爬虫自学】（scrapy实例）----爬取腾讯社会招聘职位信息

转——Python爬虫抓取大数据岗位招聘信息（51job为例）

python 爬虫如何通过selenium简单爬取boss直聘招聘职位信息

一个简单Python爬虫实例（爬取的是前程无忧网的部分招聘信息）

Python爬虫-爬取腾讯QQ招聘岗位信息（Beautiful Soup）

Python爬虫框架Scrapy实战 - 抓取BOSS直聘招聘信息

python爬虫之XPath（爬取51job招聘信息）

Python爬虫之51job招聘数据信息爬取实战

python爬虫获取拉钩网在线搜索招聘信息(超实用!)

python-scrapy爬虫框架爬取拉勾网招聘信息

Python爬虫新手入门教学（四）：爬取前程无忧招聘信息

Python招聘信息爬虫数据可视化分析大屏全屏系统开题报告

python智联招聘爬虫

[Python爬虫]智联招聘

用Python爬取了拉勾网的招聘信息+详细教程+趣味学习+快速爬虫入门+学习交流+大神+爬虫入门

利用爬虫快速获取企业招聘信息

今日推荐

火速冲上 GitHub 热榜 —— 开源编程语言、框架哪有这么可爱？

北京人形机器人创新中心发布全球首个纯电驱拟人奔跑的全尺寸人形机器人“天工”

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

周排行

浏览器对同一域名进行请求的最大并发连接数

React Hook之自定义Hook

【转】MyBatis缓存机制

-Java-泛型

自动化测试常用脚本-发送邮件

LeetCode#859: Buddy Strings

java、Python处理字符串

第二篇の博客

Hadoop伪分布式环境安装

SQL Server进阶（十一）临时表、表变量

每日归档

更多

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)