[LeetCode in Python] 692 (M) top k frequent words 前K个高频单词 - 代码天地

[LeetCode in Python] 692 (M) top k frequent words 前K个高频单词

其他 2020-03-22 07:17:30 阅读次数: 0

题目：

https://leetcode-cn.com/problems/top-k-frequent-words/

给一非空的单词列表，返回前 k 个出现次数最多的单词。
返回的答案应该按单词出现频率由高到低排序。如果不同的单词有相同出现频率，按字母顺序排序。

示例 1：

输入: ["i", "love", "leetcode", "i", "love", "coding"], k = 2
输出: ["i", "love"]
解析: "i" 和 "love" 为出现次数最多的两个单词，均为2次。
注意，按字母顺序 "i" 在 "love" 之前。

示例 2：

输入: ["the", "day", "is", "sunny", "the", "the", "the", "sunny", "is", "is"], k = 4
输出: ["the", "is", "sunny", "day"]
解析: "the", "is", "sunny" 和 "day" 是出现次数最多的四个单词，
出现次数依次为 4, 3, 2 和 1 次。

注意：

假定 k 总为有效值， 1 ≤ k ≤ 集合元素数。
输入的单词均由小写字母组成。

解题思路

python自带最小堆的实现heapq
heapq有取top k的函数heapq.nlargest(n, iterable[, key]))
上面函数的第三个参数支持多参数级联比较
直接使用nlargest()无法同时满足频率降序和名称升序
技巧是将频率前加-号，然后转为使用nsmallest()

代码

class Solution:
    def topKFrequent(self, words: List[str], k: int) -> List[str]:
        # - statistic word frequency
        freq_dict = {}
        for w in words:
            if w not in freq_dict:
                freq_dict[w] = 0
            freq_dict[w] += 1

        # - top k, sort by -freq and word
        return heapq.nsmallest(k, freq_dict, key=lambda w:(-freq_dict[w], w))

注意

使用heapq属于投机取巧，严格来讲，需要自己实现nsmallest()才能达到考察目的
更通用的做法，是参考quicksort的partition步骤，来实现top k的排序

猜你喜欢

转载自www.cnblogs.com/journeyonmyway/p/12543887.html

[LeetCode in Python] 692 (M) top k frequent words 前K个高频单词

Leetcode-692 Top K Frequent Words(前K个高频单词)

[Swift]LeetCode692. 前K个高频单词 | Top K Frequent Words

LeetCode 692. Top K Frequent Words 前K个高频单词 (Java)

LeetCode 692. Top K Frequent Words

[leetcode]692. Top K Frequent Words

[LeetCode] 692. Top K Frequent Words

[leetcode] 692. Top K Frequent Words @ python

leetcode 692. Top K Frequent Words（python）

692 Top K Frequent Words

leetcode 692. Top K Frequent Words 题解

#Leetcode# 692. Top K Frequent Words

692. Top K Frequent Words

692. Top K Frequent Words - Medium

[LC] 692. Top K Frequent Words

[leetcode]692. Top K Frequent Words K个最常见单词

LeetCode - Top K Frequent Words

LeetCode Top K Frequent Words

[LeetCode in Python] 347 (M) top k frequent elements 前 K 个高频元素

347. Top K Frequent Elements/692. Top K Frequent Words

Top K Frequent Words

Top K Frequent Words（C++前K个高频单词）

LeetCode：前K个高频单词【692】

LeetCode 692 前K个高频单词

LeetCode | 0347. Top K Frequent Elements前 K 个高频元素【Python】

leetcode692. 前K个高频单词

leetcode 692. 前K个高频单词

LeetCode 692. 前K个高频单词（优先队列）

LeetCode#692-前k个高频单词

【LeetCode】692. 前K个高频单词

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

女程序员是这样被恶搞的

B/S 和 C/S 的优缺点

vector一直申请会怎样？

座头鲸识别比赛(Humpback Whale Identification)总结

Linux高性能服务器编程——I/O复用 select

Mysql连接数据库（当包使用）

通过URI获取的文件路径为null的解决方法

1022-Primes on Interval(素数筛选+二分查找) ZCMU

Python出现： TypeError: expected string or buffer

bzoj2434: [Noi2011]阿狸的打字机 ac自动机+树状数组

每日归档

更多

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)