Data cleansing --- address information obtained by the unit name

1, the first thought of crawling Baidu Encyclopedia, access to relevant information, but there will be a problem, Baidu Encyclopedia only included some large units, such as universities, research institutes, etc., there is some get less,

2, thought of using scrapy + address information crawling Baidu map of selenium, there are some less than crawling address

3, learn to use Baidu Maps API can be obtained, it has been associated with learning, supplementary details tomorrow

Guess you like

Origin www.cnblogs.com/KYin/p/12484024.html