day03 爬取豌豆荚 - 代码天地

day03 爬取豌豆荚

其他 2019-07-03 22:38:13 阅读次数: 0

from bs4 import BeautifulSoup
import requests
#请求url https://www.wandoujia.com/category/6001
#请求方式: get

def have_title(tag):
    if tag.name == 'span' and tag.has_attr("title"):
        return tag

#获取网页
def get_page(url):
    index_res = requests.get(url)
    return index_res
#解析网页
def parse_detail(html):
    soup = BeautifulSoup(html,'lxml')
    list = soup.find_all(name='li',class_='card')

    data = ""
    for i in list:
        app_name = i.a.img.attrs['alt']
        detail_url = i.a.attrs['href']
        download_num = i.find(name='div',class_='meta').find(class_='install-count').text
        app_size =  i.find(name='div',class_='meta').find(have_title).text
        data += f"""
              名称      : {app_name}
              详情页url : {detail_url}
              下载人数  : {download_num}
               app大小  : {app_size}
                
                """
    return data

#保存数据
def save_games(data):
    with open('games.txt','w',encoding='utf-8') as f:
        f.write(data)

if __name__ == '__main__':
    url = 'https://www.wandoujia.com/category/6001'
    index_res = requests.get(url)
    index_detail = index_res.text
    data = parse_detail(index_detail)
    save_games(data)

猜你喜欢

转载自www.cnblogs.com/zzf0601/p/11129414.html

day03 爬取豌豆荚

day03 爬豌豆荚

Day 03(爬取豌豆荚app内容)

Day04-爬取豌豆荚app数据

day04爬取豌豆荚

Day04 爬取豌豆荚

Day---03 例子：爬取豌豆荚游戏页面信息

day03——抓取豌豆荚app数据

day04 爬取豌豆荚app数据的两种方法

爬取豌豆荚

Day03:爬取豌豆网游戏信息

python学习之爬取豌豆荚

Python爬虫爬取豌豆荚休闲小游戏

豌豆荚

requests + bs4 爬取豌豆荚所有应用的信息

day03爬取京东商品信息

豌豆荚下载|豌豆荚电脑版下载

python爬虫：爬取豌豆荚APP第一页数据信息（selenium）

python爬虫：爬取豌豆荚APP第一页数据信息（requests）

day03 爬取京东信息，bs4

day03(爬取京东商品信息、解析库)

仿豌豆荚ViewPager下拉

豌豆荚引起的调试问题

豌豆荚Redis集群方案：Codis

python爬虫：爬取豌豆荚APP第一页数据信息（爬虫三部曲）

day03 selenium补充及实训（爬京东商品信息）、Beautifulsoup4

day03

Spring day03

js day03

day03(1)

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)