numpy加载包含中文的csv文件报错的解决方案

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/Kwoky/article/details/83997879

import numpy as np

filename = './data/51job.csv'
arr = np.loadtxt(filename,
                 delimiter=',',
                 skiprows=1,
                 usecols=[0,1,2,3],
                 dtype=str,
                 )

print(arr)

用numpy加载包含中文的csv文件会报错:

UnicodeDecodeError: 'gbk' codec can't decode byte 0x80 in position 36: illegal multibyte sequence

解决办法一:

arr = np.loadtxt(filename,
                 delimiter=',',
                 skiprows=1,
                 usecols=[0,1,2,3],
                 dtype=str,
                 encoding='utf8'
                 )

print(arr)

解决办法二:

arr=np.loadtxt(open(filename, encoding='utf8'),
               dtype=str,
               delimiter=',',
               skiprows=1)
print(arr)

猜你喜欢

转载自blog.csdn.net/Kwoky/article/details/83997879