Use Python to crawl website data analysis

I’ve been playing Chat for a while, and I’ll look back at it from data crawling (how to crawl web pages with Python to make e-books), front-end and back-end (using Kotlin to develop SpringBoot’s Data JPA, and Angular2+ to develop Markdown editors), to development Languages ​​(TypeScript Quick Start) are covered. But when we share a Chat, have we thought about:

  1. What kind of chat is the most popular?
  2. Which type of Chat has the most authors?
  3. Who is the author who posts the most Chat?
  4. Who are the highest paid authors?
  5. What are the most popular topics?
  6. ……

Now that deep learning has made remarkable progress, the data on the Internet is like a huge gold mine. I can't tell where there is gold, but I know where there is a shovel. Today, we learn to use Selenium to grab page data, save it to MongoDB, and then use PyNum, MatplotLib, Pandas and other tools to analyze, process, and display the data, and try to solve our above questions.
Selenium_Chat.jpg

chat_member.jpg

[Read the original text] ( http://blog.techcave.cn/2018/04/04/it/chat/%E4%BD%BF%E7%94%A8Python%E7%88%AC%E5%8F%96% E7%BD%91%E7%AB%99%E6%95%B0%E6%8D%AE%E5%88%86%E6%9E%90/)

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=326159130&siteId=291194637