python爬虫爬取最好大学排名 - 代码天地

python爬虫爬取最好大学排名

其他 2020-02-12 23:50:47 阅读次数: 0

#-*- coding:utf-8 -*-
#-Author-= JamesBen
 #Email: [email protected]

import  requests
from bs4 import  BeautifulSoup
import bs4

#定义第一个函数得到网页源代码，并且可以进行稳定的运行
def  Get_HTML(url):
    try :
        use = {'User-Agent': 'Mozilla/5.0'}  #此行代码骗过服务器我们是使用浏览器进行访问的，防止有些网站对我们进行拦截
        r = requests.get(url, timeout = 30,headers = use)
        r.raise_for_status()      #如果状态不是200引发HTTPError异常
        r.encoding = r.apparent_encoding  #将文本的编辑方式传给头，防止造成编码错路出现乱码
        return  r.text
    except :
            return "产生异常"

#定义一个函数得到特定的tr标签
def  U_list(ulist,html):
    soup = BeautifulSoup(html,"html.parser")
    for tr in soup.find("tbody").children:
        if  isinstance(tr,bs4.element.Tag):  #筛选tr标签的类型，如果不是Tag定义的类型将过滤掉
            tds = tr("td")
            ulist.append([tds[0].string,tds[1].string,tds[3].string])
    pass

#格式化输出函数
def print_Univlist(ulist,num):
    tplt="{0:^10}\t{1:{3}^10}\t{2:^10}"
    print(tplt.format("排名","学校名称","总分",chr(12288)))
    for i in range(num):
        u=ulist[i]
        print(tplt.format(u[0],u[1],u[2],chr(12288)))
    print("Suc"+str(num))

def main():
    uinfo = []
    url = 'http://www.zuihaodaxue.com/zuihaodaxuepaiming2019.html'
    html = Get_HTML(url)
    U_list( uinfo,html)
    print_Univlist( uinfo,20)


if __name__ == "__main__":
    main()

猜你喜欢

转载自www.cnblogs.com/James-Ben/p/12301783.html

python爬虫爬取最好大学排名

爬虫爬取最好大学排名

使用python爬虫爬取最好大学网大学排名实例

爬取软科中国最好大学排名

爬虫之爬取最好大学排名实例

使用Python爬取最好大学网大学排名

Python 最好大学网大学排名爬取（2020年）

Python爬虫实现[中国最好大学排名2016]

python爬虫-中国最好大学排名

python3爬虫-中国最好大学排名

【实例】爬取2018中国最好大学排名

标记信息形式&&提取方法&&定向爬取中国最好大学排名

爬虫——最好大学排名实例

爬虫日记-最好大学排名实例

利用python网络爬虫获取软科中国最好大学排名2019数据

python 爬虫实例爬取中国大学排名

定向爬虫，爬取中国大学排名 Python

Python爬虫之BeautifulSoup库——爬取大学排名

爬虫爬取大学排名示例

使用正则表达式和urllib模块爬取最好大学排名信息

定向爬取大学排名-Python

python,网络爬虫完整示例代码－－抓取中国最好大学排名网站信息，并进行输出显示

python爬取大学排名，电影的排名与评分

python 爬虫爬取最好大学网，并存入 mysql 数据库

【Python爬虫】从html里爬取中国大学排名

Python爬虫——定向爬取“中国大学排名网”

python爬虫爬取2020年中国大学排名

Python爬虫入门实例三之爬取软科中国大学排名

python爬虫案例典型：爬取大学排名（亲测有效）

国内大学排名如何？用Python爬取中国大学排名

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)