LZ78编码算法 - 代码天地

LZ78编码算法

其他 2019-01-19 08:50:59 阅读次数: 0

LZ78算法的压缩过程非常简单。在压缩时维护一个动态词典Dictionary，其包括了历史字符串的index与内容；压缩情况分为三种：

若当前字符c未出现在词典中，则编码为(0, c)；
若当前字符c出现在词典中，则与词典做最长匹配，然后编码为(prefixIndex,lastChar)，其中，prefixIndex为最长匹配的前缀字符串，lastChar为最长匹配后的第一个字符；
为对最后一个字符的特殊处理，编码为(prefixIndex,)。

如果对于上述压缩的过程稍感费解，下面给出三个例子。例子一，对于字符串“ABBCBCABABCAABCAAB”压缩编码过程如下：

1. A is not in the Dictionary; insert it

2. B is not in the Dictionary; insert it

3. B is in the Dictionary. BC is not in the Dictionary; insert it.

4. B is in the Dictionary. BC is in the Dictionary. BCA is not in the Dictionary; insert it.

5. B is in the Dictionary. BA is not in the Dictionary; insert it.

6. B is in the Dictionary. BC is in the Dictionary. BCA is in the Dictionary. BCAA is not in the Dictionary; insert it.

7. B is in the Dictionary. BC is in the Dictionary. BCA is in the Dictionary. BCAA is in the Dictionary. BCAAB is not in the Dictionary; insert it.

例子二，对于字符串“BABAABRRRA”压缩编码过程如下：

1. B is not in the Dictionary; insert it

2. A is not in the Dictionary; insert it

3. B is in the Dictionary. BA is not in the Dictionary; insert it.

4. A is in the Dictionary. AB is not in the Dictionary; insert it.

5. R is not in the Dictionary; insert it.

6. R is in the Dictionary. RR is not in the Dictionary; insert it.

7. A is in the Dictionary and it is the last input character; output a pair containing its index: (2, )

如何进行解压：

解压缩能更根据压缩编码恢复出（压缩时的）动态词典，然后根据index拼接成解码后的字符串。为了便于理解，我们拿上述例子一中的压缩编码序列(0, A) (0, B) (2, C) (3, A) (2, A) (4, A) (6, B)来分解解压缩步骤，如下图所示：

前后拼接后，解压缩出来的字符串为“ABBCBCABABCAABCAAB”。

猜你喜欢

转载自blog.csdn.net/qq_41989372/article/details/84921845

LZ78编码算法

LZ78编码Java实现

【数据压缩】LZ78算法原理及实现

LZ4编码

nginx使用gzip压缩文件---lz77算法---Haffman编码

LZ4压缩算法

图解LZ77压缩算法

【压缩算法之LZ4】

LZ4----优秀的压缩算法

LZ4压缩算法分析

LZ系类经典压缩算法介绍

简单实现LZ77压缩算法

GZIP中的LZ77压缩算法

数据压缩算法---LZ77算法的分析与实现

LZ4压缩算法（库+头文件+范例）

速度之王 — LZ4压缩算法（一）

WOW! LZ4, 超越Snappy的压缩算法

LZ4算法实现对文件目录的压缩

lz4 - Extremely fast compression压缩算法

golang常用库之- pierrec/lz4包 | lz4命令、lz4压缩算法(高压解速度)

78

78！

项目实战——基于LZ77变形和哈夫曼编码的GZIP压缩

LeetCode算法系列：78. Subsets

LeetCode算法题78：子集解析

dskfmkkfhm;lz

基于Huffman算法和LZ77算法的文件压缩（八）

基于Huffman算法和LZ77算法的文件压缩（七）

基于Huffman算法和LZ77算法的文件压缩（六）

基于Huffman算法和LZ77算法的文件压缩（五）

今日推荐

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

周排行

计算机组成与设计（七）—— 除法器

Integer Approximation(分治+枚举)

大话数据库索引

windows10系统JDK的配置及下载地址

mysql实现秒值转换中原六仔平台搭建

Codeforces Round #556 (Div. 1)

百练1064 网线主管

Codeforces 995F Cowmpany Cowmpensation

子集生成之增量构造法，位向量法，二进制法

ERROR: cmd.exe failed with args /c "/APK\gradle\rungradle.bat...

每日归档

更多

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)