pandas dataFrame to_excel 报错[ UnicodeDecodeError: 'ascii' codec can't decode byte 0xe7 in position 7

问题提出

python code

import pandas as pd
df = pd.read_csv("text.csv", sep="\t")
df.to_excel("test.xlsx")

error info

UnicodeDecodeError: 'ascii' codec can't decode byte 0xe7 in position 7: ordinal not in range(128)

解决办法

直接上 code

import sys
sys.setdefaultencoding('utf-8')
import pandas as pd
def csv2excel(fp):
    df = pd.read_csv(fp, sep="\t")
    cols = df.columns
    ## 转换每一列的编码
    for e in cols:
        df[e] = df[e].map(lambda x: str(x).decode("utf8").encode("raw_unicode_escape").decode("raw_unicode_escape")) 
        # 这里的 utf8 为 python 运行环境默认编码, 即 sys.getdefaultencoding()
        print e
    df.to_excel(fp.replace(".csv", ".xlsx"))
    print fp

if __name__ == "__main__":
    fp = "test.csv"
    csv2excel(fp)

参考资料

  1. Codec registry and base classes
  2. python encode\decode
  3. How to fix: “UnicodeDecodeError: ‘ascii’ codec can’t decode byte”
  4. PYTHON-进阶-编码处理小结

猜你喜欢

转载自blog.csdn.net/Sinsa110/article/details/78560899