网页正文提取工具Beautiful Soup

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/hellonlp/article/details/74556091

Beautiful Soup是什么?

Beautiful Soup is a Python library for pulling data out of HTML and XML files. It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work.


中文文档:https://www.crummy.com/software/BeautifulSoup/bs4/doc/index.zh.html

英文文档:https://www.crummy.com/software/BeautifulSoup/bs4/doc/

Beautiful Soup 的用法教程:http://wiki.jikexueyuan.com/project/python-crawler-guide/beautiful-soup.html

猜你喜欢

转载自blog.csdn.net/hellonlp/article/details/74556091