Inventory of a Python web crawler over verification code (method 2)

Click on " Python Crawler and Data Mining " above to pay attention

Reply to " Books " to get a total of 10 e-books on Python from entry to advanced





Lightly gathered, slowly twisted, wiped and picked again, first it was "Neon Clothes" and then "Six Yao".

Hello everyone, I am Pippi.

I. Introduction

A few days ago in the Python strongest king group [鶏 ah 鶏. ] I asked a Pythonquestion about a web crawler, and I will share it with you here.

The previous article has given a method to directly request the url of the image, and then identify it, which is very efficient. In this article, let’s take a look at another method.

2. Implementation process

This method is provided by the fans themselves. On the basis of his original code, he has optimized it. Instead of locating the picture based on the location, it locates the picture according to the label of the picture. The optimized code is as follows:


If you are not familiar with requests and Beautiful, you can consider the solution in the above picture.

In fact, it can be slightly optimized here. For example, the second sleep can be removed. In addition, the screenshot can be directly added after the xpath line of code, and it can be done in one step. Interested friends can try it out.

At present, two methods have been given, and the other method, let’s read the next article together, and use the coding platform to help, so stay tuned!

3. Summary

Hello everyone, I am Pippi. This article mainly takes stock of Pythonthe problem of a web crawler passing verification codes. Aiming at this problem, the article gives specific analysis and code implementation to help fans solve the problem smoothly.

Finally, I would like to thank fans [鶏啊鶏] for asking questions, thank [Classmate Ning], [Brother Wei], [Teacher Yuliang], [Zheng Yuzhe Xiaopang] for their ideas and code analysis, and thank [Ineverleft] and others for participating in the learning exchange.

[Supplementary questions] Warm reminder, when you ask questions in the group. You can pay attention to the following points: if it involves large file data, you can desensitize the data, send some demo data (meaning small files), and then paste some code (the kind that can be copied), and remember to send the screenshot of the error report (complete cut ). If there are not many codes, just send the code text directly. If the code exceeds 50 lines, just send a .py file.


If you have any problems during the learning process, please feel free to contact me to solve them (my WeChat: pdcfighting1). At the request of fans, I have created some high-quality Python paid learning exchange groups and paid order receiving groups. Welcome everyone to join me Python learning exchange group and order receiving group!


Friends, hurry up and practice it! If you encounter any problems during the learning process, please add me as a friend, and I will pull you into the Python learning exchange group to discuss learning together.


------------------- End -------------------

Recommendations for past wonderful articles:


Welcome everyone to like, leave a message, forward, repost, thank you for your company and support

If you want to join the Python learning group, please reply in the background [ join the group ]

Thousands of rivers and thousands of mountains are always in love, can you click [ Looking ]

/Today's Message Topic/

Just say a few words~~

Guess you like