After crawling anime "addicted", I gave up my lunch break and couldn't wait to use Python to smash the data of Tencent Anime, tsk tsk tsk

This is the 10th of 120 cases of reptiles

In the process of writing this blog, Brother Zha told me that he reviewed "Under One Person" and "Supreme Pupil Master: Miss Peerless" by the way , doge.

After reading this article, you will gain

  1. 5000+ Tencent animation data;
  2. Regular expression area extraction;
  3. Multithreaded crawler.

Tencent animation data collection technology

Target data source analysis

Crawl the target website

The target website for this crawling is: https://ac.qq.com/Comic/index/page/1 .

After crawling anime "addicted", I gave up my lunch break and couldn't wait to use Python to smash the data of Tencent Anime, tsk tsk tsk
For the data in the above figure, this paper will collect the data of the frame selection area in the figure below, and this paper will use regular expressions to match the area blocks.

Guess you like

Origin blog.csdn.net/hihell/article/details/118340372