python凤凰新闻数据分析(三)新闻数据可视化——词云

使用Matplotlib、jiaba、wordcloud等库函数

从数据库中获取数据,以str形式写入列表(转换数据库中的数据type),然后将列表转换为一个字符串,使用jieba进行分词统计,统计后是一个长文本字符串。

Information_news = Information.objects.all()
Information_news_list = []
for each_I in Information_news:
         Information_news_list.append(str(each_I))
str_ = "".join(Information_news_list)
Information_news_text = ' '.join(jieba.cut(str_))
使用wordcloud生成词云图片并保存,其中msyh.ttc是微软雅黑字体,中文显示问题可查看我 之前的文章
path = 'msyh.ttc'

wordcloud = WordCloud(font_path=path).generate(Information_news_text)
plt.imshow(wordcloud,interpolation='bilinear')
plt.axis("off")
plt.savefig("C:/website/static/figure/Information.png")

结果如图:


完整代码如下(tool文件夹下的views.py):

from django.shortcuts import render,render_to_response

# Create your views here.
from django.http import HttpResponse
from tool.models import *
from wordcloud import WordCloud
import matplotlib.pyplot as plt
import jieba

def plot(request):
    Information_news = Information.objects.all()
    Information_news_list = []
    for each_I in Information_news:
        Information_news_list.append(str(each_I))
    str_ = "".join(Information_news_list)
    Information_news_text = ' '.join(jieba.cut(str_))

    path = 'msyh.ttc'
    
    wordcloud = WordCloud(font_path=path).generate(Information_news_text)
    plt.imshow(wordcloud,interpolation='bilinear')
    plt.axis("off")
    plt.savefig("C:/website/static/figure/Information.png")

return render(request,'news_plot.html')


猜你喜欢

转载自blog.csdn.net/css_aaa/article/details/80971961