Python crawler introductory combat 1: Obtaining CSDN personal blog article directory and reading data

☞ ░LaoYuan Python blog post directory: https://blog.csdn.net/LaoYuanPython/article/details/98245036

I. Introduction

There has been a relatively large increase in the number of visits to blogs for a while, from the conventional fluctuation range of 1000-3000, which has almost doubled, and the growth of fans has almost doubled from an average of 10-40 people per day. The following is The data graph of blog post visits and fan growth provided by csdn: The
Insert picture description here
Insert picture description here
sudden increase is unexpected, and the old ape really wants to figure out what articles these visits and fans are bringing. But I didn’t read the latest blog post, and I don’t remember whether the reading volume of the previous blog post has increased. It is very troublesome to read it by myself, because there are a lot of blog posts, so I thought that since I learned the crawler anyway, I would write a program to go to CSDN. Get and record data.

2. Background knowledge

  • In order to crawl data from CSDN, this article uses urllib.request and B commonly used by crawlers

Guess you like

Origin blog.csdn.net/LaoYuanPython/article/details/113740717