python第四次作业 - 代码天地

python第四次作业

其他 2018-10-15 12:00:22 阅读次数: 0






q = open('遇见.txt', 'r', encoding='utf-8').read()
wordsls = jieba.lcut(q)
wcdict = {}
for word in wordsls:
    if len(word) == 1:
        continue
    else:
        wcdict[word] = wcdict.get(word, 0) + 1

wcls = list(wcdict.items())
wcls.sort(key=lambda x: x[1], reverse=True)
print(wcls)
for i in range(7):
    print(wcls[i])

　　

#准备utf-8编码的文本文件，通过文件读取字符串str
fo=open('because of you.txt','r',encoding='utf-8')
stra = fo.read().lower()
fo.close()
print(stra)

sep=',.;!'
for ch in sep:
    stra = stra.replace(ch,'')#进行预处理，清除掉sep中存在的标点符号
print(stra)

strList=stra.split('')
print(len(strList),strList)#分解提取单词，转化为列表list

strSet = set(strList)
print(len(strSet),strSet)#转化为集合

strDict={}
for world in strSet:
    strDict[world] = strList.count(world)
print(len(strDict),strDict)#转化为字典，计算上一个集合中每个单词出现的次数

wcList=list(strDict.items())
print(wcList)#将字典中的目录转化为列表输出
wcList.sort(key=lambda x:x[1],reverse= True)
print(wcList) #按降序输出

e = {'a','the','an','and','i','or','of'}
strSet = strSet - e
print(len(strSet),strSet) #排除语法型词汇，代词、冠词、连词等无语义词

for i in range(20):
    print(wcList[i]) #TOP20输出

　　

猜你喜欢

转载自www.cnblogs.com/asyxhs/p/9790103.html

python第四次作业

第四次python作业

【python】第四次课作业

python第四次作业——曾景

Python第四次作业-----宋舒婷

python第四次作业——陈灵院

Python 第四次作业叶炜

2018第四次作业

第四次作业树

第四次作业（2）

2018第四次作业。

第四次PTA作业

第四次作业

第四次作业-树

c第四次作业

第四次作业——树

PTA第四次作业

第四次作业--树

第四次·作业

第四次团队作业

团队作业第四次

团队第四次作业

图论第四次作业

第四次博客作业

oo第四次作业

0706 第四次作业

第四次作业-----------数组

第四次作业——数组

第四次作业----数组

~第四次作业~

今日推荐

LFOSSA 源来如此公开课 | 掌握云原生未来：CNCF 认证全面攻略与备考秘籍

国产云输入法——仅华为无云端数据上传安全问题

开源日报 | 工业开源项目OGG 1.0；姐姐，你要和我一起配置火狐吗；苹果AI遥遥落后？Fedora 40

开放签电子签章：停止新增，优化体验，前进更进（五一假期前工作）

开源日报 | 中学生开源前端动画引擎；全球首个Llama3 8B中文版开源模型；联想电脑恐出局；Linus讽刺AI炒作

“百模大战”必有一战 | 2024中国“百模大战”竞争格局分析

周排行

Family Tree 题解

BZOJ 1093 最大半连通子图 SCC + DP

幂等处理

Spring----学习（2）----XML 配置Bean 自动装配

SQL Server 远程更新目标表数据

HIbernate3.6 环境搭建

特殊符号正则表达式

【Linux】第一章进程的理解

843. n-皇后问题（dfs+输出各种情况）

空间数据库2

每日归档

更多

2024-04-26(39)

2024-04-25(22)

2024-04-24(36)

2024-04-23(26)

2024-04-22(39)

2024-04-21(0)

2024-04-20(6)

2024-04-19(5)

2024-04-18(0)

2024-04-17(5)