Python web crawler from 0 to 1 (0): order and directory

Python web crawler from 0 to 1 (0): order and directory

sequence

  Many people say that today, we are in an era of information explosion, surrounded by all kinds of information. From an ordinary user's point of view, the information explosion is nothing more than adding the words "big data" to the advertisements to highlight the loftiness. You can always "happen" to get the content you care about most in the recommendation column , nothing more. In the eyes of a developer, a small feature is behind thousands of lines of code, countless high-rise buildings at night with sparkling screens, and a sea of ​​knowledge in the minds of himself and his team . The web crawler, that is, a lighthouse standing on the sea, although only one of them, has its own unique light. Its presence not only appears in large projects, but also often appears in small tools. Python, as a popular "all-round" back-end language, is also often used for the development of such functions. In this series, we will start from zero, learn about web crawlers and explore the world behind those data.
  This series of content is mainly for readers who have a certain Python programming foundation (basic syntax), but do not have more systematic Web crawler-related knowledge. From the shallower to the deeper, understand the principles of Python web crawlers and apply them to actual projects. The content is hand-printed, and the samples have been verified, which is convenient for readers to deepen the impression in practice.
  The content of this series mainly refers to the team course of Teacher Song Tian of Beijing Institute of Technology, which is also the study notes of the author in a certain sense. Hope to share knowledge with everyone and make progress together.

table of Contents

Python web crawler from 0 to 1 (1): Detailed introduction to the Requests library
(this series is not over, please wait for subsequent updates)

Guess you like

Origin blog.csdn.net/Zheng__Huang/article/details/108354215