python爬取微博热题-官宣赵丽颖并做词云分析

        来来来,看一下今天的头条新闻,接下来我们来python以下到底发生了什么情况,毕竟我虽然不关注这些,但是一早上被很多人通知,并且报以不友好的评价。通过python去爬该条微博的评论,并做分词,且做词云分析:

###部分代码


import requests,json,re


def get_comment():
    headers = {
           'Cookies':'_T_WM=cf11b1dea37c9289ed064f8059b35d3d; SCF=Ai8Ki_8XZFpkUIS91rKFntgL-MmbDkbYxignZAs0XqMN-LtWo9nbpVLp-TbiiyWu3WD0hShJb4u33VG5ltISuD4.; SUB=_2A252wQgFDeRhGeBL71EV-C3JzzuIHXVSTahNrDV6PUJbkdBeLWLVkW1NRxtPax6yparr4gNIxExtsJOQF0Y7BP-s; SUHB=079j6k0Kc9u8Zb; SSOLoginState=1539668053; MLOGIN=1; M_WEIBOCN_PARAMS=oid%3D4295689414745032%26lfid%3D4295689414745032%26luicode%3D20000174; WEIBOCN_FROM=1110106030',
           'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/69.0.3497.100 Safari/537.36'
           }
    #url = 'https://m.weibo.cn/api/comments/show?id=4295689414745032&page=1'
    for i in range (0,101):
        url_comment = 'https://m.weibo.cn/api/comments/show?id=4295689414745032&page=%d'%(i)
        wb_data = requests.get(url_comment,headers=headers).text
        data_comment = json.loads(wb_data)
       # print(data_comment)
        data = data_comment['data']
        for a in data['data']:
            print(re.sub('<[^>]*>', '',a['text']))


def main():
    get_comment()


if __name__ == '__main__':
    main()

最后的结果还是很好的

猜你喜欢

转载自blog.csdn.net/OYY_90/article/details/83096593