Product manager to learn Python- reptiles crucial no.1

Windows operating system

1, various tutorials - install the Python interpreter -3.7 version, IDE tools -Pycharm,

2, the basic principles of web crawler

  ① initiate a request -, starting a Resquest (request method (post / get, put / delete, etc.)) + Headers- key-value pairs;

  ② content acquisition response -Response, types include HTML / json, binary, etc.; (state of the Status Code, Response header Response Headers)

  ③ corresponding to the content parsing, using regular, page parsing library, JSON, binary object, etc.

  It ④ saves data, such as text files or databases

3、

Guess you like

Origin www.cnblogs.com/bogepm/p/11333024.html