Windows operating system
1, various tutorials - install the Python interpreter -3.7 version, IDE tools -Pycharm,
2, the basic principles of web crawler
① initiate a request -, starting a Resquest (request method (post / get, put / delete, etc.)) + Headers- key-value pairs;
② content acquisition response -Response, types include HTML / json, binary, etc.; (state of the Status Code, Response header Response Headers)
③ corresponding to the content parsing, using regular, page parsing library, JSON, binary object, etc.
It ④ saves data, such as text files or databases
3、