利用python爬取实习僧网站上的数据 - 代码天地

利用python爬取实习僧网站上的数据

其他 2018-09-22 14:21:11 阅读次数: 0

最近在找实习，就顺便想到用python爬取一些职位信息看看，有哪些岗位比较缺人。

#_*_coding:utf-8_*_

import requests
from bs4 import BeautifulSoup
import xlwt
import re

book = xlwt.Workbook()
#创建表格
sheet = book.add_sheet('sheet1', cell_overwrite_ok=True)

def getHtml():
 url ='http://www.shixiseng.com/interns?p='
 request = requests.get(url=url)
 respons = request.content      #得到页面源代码
 soup = BeautifulSoup(respons,'html.parser')   #解析源代码
 #下面是计算岗位列表的页数
 page=soup.select('div#pagebar')[0]   
 l=str(page.select('li')[-1].a.attrs['href'])
 x=re.compile(r'\d{3}')
 y=x.search(l)
 lastpage=int(y.group())
 print lastpage
 #调用函数
 saveData(url,lastpage + 1)

def saveData(url,lastpage):
    row=0  #必须定义为全局变量
    for i in range(1,lastpage):
        html = requests.get(url='%s%d' % (url,i)).content
        soup = BeautifulSoup(html,'html.parser')
        infos = soup.select('div.posi-list')[0].select('div.list')
        #相关的数据信息
        for info in infos:
            po_name = info.select('div.names.cutom_font')[0].a.text
            part = info.find('a', class_='cutom_font').text
            addr = info.find('div', class_='addr').span.text
            xz = info.find('div', class_='xz').span.text

        #写入excel
            sheet.write(row, 0, po_name)
            sheet.write(row, 1, part)
            sheet.write(row, 2, addr)
            sheet.write(row, 3, xz)
            row+=1


if __name__ == '__main__':
    getHtml()
    book.save('shixiseng.xls')

猜你喜欢

转载自blog.csdn.net/heavenmark/article/details/73716736

利用python爬取实习僧网站上的数据

实习僧网站爬取数据

将爬取的实习僧网站数据传入HDFS

爬取实习僧网站并存储

Python爬取实习僧算法JD

Python爬取实习僧职位信息

【python实现网络爬虫（4）】实习僧网站信息爬取（字体反爬虫破解）

Python3.5：爬取网站上电影数据

Python爬虫爬取网站上的图片

scrapy爬取实习僧全站

Python爬取网站上面的数据很简单，但是如何爬取APP上面的数据呢

PYTHON爬取网站上面的数据很简单,但是如何爬取APP上面的数据呢

python爬虫字体反爬实习僧

Python转页爬取某铝业网站上的数据

Python爬虫—爬取某网站上面所有的世界港口信息数据

字体反爬破解学习--爬取实习僧

Python爬取网站上的内链和外链

python爬取网站上的图片并保存到本地

爬取网站上无法下载音频数据

怎样使用Scrapy爬取NVD网站上的数据

python练手实战项目:爬取实习僧招聘信息

【python爬虫系列】14.实战三爬取实习僧

爬虫——爬取网站上的图片

如果利用Python爬取B站上千万数据？B站直播都是大屌萌妹吗？

Python爬取网站数据

利用python爬取分享网站链接

利用linux curl爬取网站数据

Python爬取前程无忧网站上python的招聘信息

python爬取网站上所有诗句（第二版）

Python爬取www.alexa.cn网站上的部分url和相应的等级

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

Java基础复习_day13_Collection集合

2018.11.16 c语言学习经验

且看Java内置四大核心函数式接口

小程序云开发中数据库的数据分段和显示图片

python的函数

Web-JS进阶

【干货】C++常用代码积累笔记大全

Spring的ioc操作与 IOC底层原理

构建之法20191121-11 Scrum立会报告+燃尽图 07

Spring boot之Hello World访问404

每日归档

更多

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)