使用Matplotlib、jiaba、wordcloud等库函数
从数据库中获取数据,以str形式写入列表(转换数据库中的数据type),然后将列表转换为一个字符串,使用jieba进行分词统计,统计后是一个长文本字符串。
Information_news = Information.objects.all()
Information_news_list = []
for each_I in Information_news:
Information_news_list.append(str(each_I))
str_ = "".join(Information_news_list)
Information_news_text = ' '.join(jieba.cut(str_))
使用wordcloud生成词云图片并保存,其中msyh.ttc是微软雅黑字体,中文显示问题可查看我
之前的文章:
path = 'msyh.ttc'
wordcloud = WordCloud(font_path=path).generate(Information_news_text)
plt.imshow(wordcloud,interpolation='bilinear')
plt.axis("off")
plt.savefig("C:/website/static/figure/Information.png")
结果如图:
完整代码如下(tool文件夹下的views.py):
from django.shortcuts import render,render_to_response
# Create your views here.
from django.http import HttpResponse
from tool.models import *
from wordcloud import WordCloud
import matplotlib.pyplot as plt
import jieba
def plot(request):
Information_news = Information.objects.all()
Information_news_list = []
for each_I in Information_news:
Information_news_list.append(str(each_I))
str_ = "".join(Information_news_list)
Information_news_text = ' '.join(jieba.cut(str_))
path = 'msyh.ttc'
wordcloud = WordCloud(font_path=path).generate(Information_news_text)
plt.imshow(wordcloud,interpolation='bilinear')
plt.axis("off")
plt.savefig("C:/website/static/figure/Information.png")
return render(request,'news_plot.html')