python中的中文字符处理decode和encode - 代码天地

python中的中文字符处理decode和encode

其他 2018-12-06 10:41:15 阅读次数: 0

摘抄:

字符串在Python内部的表示是Unicode编码,因此,在做编码转换时,通常需要以unicode作为中间编码,即先将其他编码的字符解码(decode)成unicode,再从unicode编码(encode)成另一种编码。

decode的作用是将其他编码的字符转换成unicode编码,如str1,decode('gb2312'),表示将gb2312编码的字符串str1转换成unicode编码。

encode的作用是将unicode编码转换成其他编码的字符串,如str2,encode('gb2312'),表示将unicode编码的字符串str2转换成gb2312编码。

因此,转码的时候一定要明白,字符串str是什么编码,然后decode成unicode编码,然后再encode成其他编码。

import zipfile
azip=zipfile.ZipFile("C:\\Users\\160505\\Downloads\\商品导出_20181127_1359.zip")
Traceback (most recent call last):
  File "<input>", line 1, in <module>
  File "c:\python27\Lib\zipfile.py", line 756, in __init__
    self.fp = open(file, modeDict[mode])
IOError: [Errno 2] No such file or directory: 'C:\\Users\\160505\\Downloads\\\xe5\x95\x86\xe5\x93\x81\xe5\xaf\xbc\xe5\x87

azip=zipfile.ZipFile(u"C:\\Users\\160505\\Downloads\\商品导出_20181127_1359.zip")
for file in azip.namelist():
    print file
    
12.xlsx
��ͨ��Ʒ.xlsx
��װ��Ʒ.xlsx

for file in azip.namelist():
    print type(file)    
    print type(file.decode('gbk'))
    print type(file.decode('gbk').encode('utf-8'))
    
<type 'unicode'>
<type 'unicode'>
<type 'str'>
<type 'str'>
<type 'unicode'>
<type 'str'>
<type 'str'>

总结：中文的处理，先转化为unicode（decode编码），再进行解码（encode），基本都可以这么操作

猜你喜欢

转载自blog.csdn.net/zhouxuan623/article/details/84584389

python中的中文字符处理decode和encode

python中字符串的encode和decode

python字符串 decode 和 encode

Python: 在CSV文件中写入中文字符

python提取url中的所有中文字符

Python 3中使用中文字符报错

Python中怎么识别中文字符？

python 花式bar绘图和中文字符显示

url 中文字符处理

lua string 处理中文字符

C++对中文字符的处理

python—获取字符串格式的序列的中文字符，判别和提取中文字符的方法

python中decode与encode

python中的encode（）和decode（）的用法

python中decode和encode的区别

python中的encode()和decode()函数

python中的decode（编码）和encode（解码）

python中的encode（）和decode（）函数

python中decode和encode区别

python中decode()和encode()的使用

python中的编码以及解码问题（中文字符处理以及文件处理的某些注意事项）

python decode和encode

python decode()和encode()

python 允许出现中文字符

python 正则匹配中文字符

python统计中文字符数量

python 中文字符操作

opencv python中文字符显示

python去除中文字符

Python 打印中文字符

今日推荐

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

周排行

【转】spring中对控制反转和依赖注入的理解

tms webcore 安装和使用

java程序员进阶相关书籍

SpringMVC接受请求参数、

如何保存训练好的机器学习模型

MyEclipse、Eclipse设置项目JDK的三个地方

商超行业微信小程序开发定制一般多少钱（行业技术人员解读）

Markdown编辑器语言——30分钟入门到到精通

Linux系统下MongoDB的简单安装与基本操作

Power Strings

每日归档

更多

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)