对字符串列表进行数据频度统计,并且进行排名,选出前几名
import numpy as np
li = ['cc','aa','cc','bb','aa','cc','cc','aa','cc','##','##','##']
uni = np.unique(li) # 唯一值
dic = dict((a, li.count(a)) for a in uni) # 以字典格式来显示数据频度
print(dic.items())
结果:
dict_items([('##', 3), ('aa', 3), ('bb', 1), ('cc', 5)])
dic = sorted(dic.items(), key=itemgetter(1), reverse=True) # True--降序
print(dic)
print(list(x[0] for x in dic[:2])) # 选出结果前两名的字符串
结果:
[('cc', 5), ('##', 3), ('aa', 3), ('bb', 1)]
['cc', '##']
sorted()
对所有可迭代的对象进行排序操作
每篇小附录:
Adobe下载和安装的错误代码和解决方案