Python学习(爬取信息1)

0. 前言

本部分为爬虫自学教程

1. 代码部分

1、爬取信息(学习)

import requests(模块)
import bs4(未安装模块)
res=requests.get("https://movie.douban.com/top250")
soup=bs4.BeautifulSoup(res.text,"html.parser")
targets=soup.find_all("div",class_="hd")
for each in targets:
    print(each.a.span.text)
结果未出来

2、参考网站 https://ilovefishc.com/dvd/

3、学习爬取豆瓣电影信息

import requests
from bs4 import BeautifulSoup

for i in range (0,10):
    url = "https://movie.douban.com/top250?start="+(str(i*25))
    #获取网页
    response = requests.get(url)
    #解析网页
    soup = BeautifulSoup(response.text,"html.parser")
    movie_list = soup.find_all(name='div',attrs={'class':'info'})
    #print(movie_list)
    print("\n"+str(i+1)+" 页:\n")
    #遍历网页信息
    for movie_information in movie_list:
        m_name = movie_information.find(name = 'span',class_ = 'title').text
        m_score = movie_information.find(name = 'span',class_='rating_num').text
        print(m_name+"            "+m_score)
发布了26 篇原创文章 · 获赞 12 · 访问量 1767

猜你喜欢

转载自blog.csdn.net/y_j_6666/article/details/104179696