python 中文乱码解决方案 - 代码天地

python 中文乱码解决方案

其他 2020-06-24 17:04:44 阅读次数: 0

python 处理文字内容时，常常遇到编码的问题。

汉字常用的两种编码方式为 utf8 和 gbk，解析一个 txt 文件或者一个字符串时经常会遇到编码问题。

python 处理文字内容时，常常遇到编码的问题。

汉字常用的两种编码方式为 utf8 和 gbk，解析一个 txt 文件或者一个字符串时经常会遇到编码问题。

def force_decode(string:bytes) ->str:
    """
    sometimes neither gbk nor gbk can decode succseefully from string
    select longger decode result from utf8 or gbk
    """
    if not isinstance(string, bytes):
        raise ValueError('expected bytes array')
    decode_chars_count = []
    for i in ['utf8', 'gbk']:
        try:
            return string.decode(i)
        except UnicodeDecodeError as ex:
            decode_chars_count.append(ex.start)
    # neither utf8 or gbk decode successfully
    # select the longer decode one
    utf8_len, gbk_len = decode_chars_count
    selected_encoding = 'utf8' if utf8_len > gbk_len else 'gbk'
    return string.decode(selected_encoding, errors='ignore')

代码链接：https://gist.github.com/albertofwb/b53bf32adca5c245c6dee6642ca5463d

猜你喜欢

转载自www.cnblogs.com/albertofwb/p/13188372.html

python 中文乱码解决方案

python写入csv文件中文乱码解决方案

Python HTTP库requests中文页面乱码解决方案！

linux下python中文乱码解决方案

python对打印出中文乱码问题的解决方案

【Python】中文乱码问题与解决方案深入分析

Python HTTP库requests中文页面乱码解决方案 Python HTTP库requests中文页面乱码解决方案！

python爬取网页中文乱码。解决方案。python3

python3 java调用python出现中文乱码解决方案

mac下python matplotlib中文乱码解决方案（亲测可用）！！

python3.x+requests 爬取网站遇到中文乱码的解决方案

Python requests包get响应内容中文乱码解决方案

Python------工具之sublimeText3控制台输出中文乱码的解决方案

python编程中中文输出乱码UnicodeEncodeError: 'ascii' codec can't encode character解决方案

Python - Sublime Text 3 控制台输出中文乱码的解决方案

Python中requests.get响应内容中文乱码解决方案

Python3数据插MySQL中文乱码解决方案

python发送邮件，中文附件下载乱码问题解决方案

Python使用pylab绘制图像中文乱码问题解决方案

python输出HTMLTestRunner的html测试报告，中文乱码解决方案并上例子说明

python requests乱码解决方案

VS中使用Python2.x编译器打印中文出现乱码的解决方案

python读取us7ascii字符集Oracle数据库中文乱码的解决方案

Python2操作JSON出现乱码的解决方案

中文乱码及解决方案

Python文件处理os模块介绍（os.system运行shell命令中文乱码解决方案） -*- Python基础知识12 -*-

Python Matplot中文显示完美解决方案

Python中的中文编码问题及解决方案

解决中文乱码的几种解决方案

python解决Requests中文乱码

今日推荐

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

《2024 年一季度互联网投融资运行情况》研究报告

报告：Django 仍然是 74% 开发者的首选

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

周排行

记一下去大梅沙的准备（2018-05-26）

Spring 注解事务

基于HTTP协议的客户端缓存

阿里云rds 备份和还原

[PHP] 几个拖慢 PHP 程序/API 运行速度的点

python 代码风格------------PEP8规则

js控制json生成菜单——自制菜单（一）

将字符串: 'k:1|k1:2|k2:3|k3:4 ' ,处理成 python 字典: {'k':1, 'k1':2, ...}

微信小程序转支付宝小程序

Qt551.窗口滚动条

每日归档

更多

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)