python 读入csv 出现utf-8 错误

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/paulkg12/article/details/80290753

在解析csv文件的时候出现报错:

'utf-8' codec can't decode byte 0xff in position 0

这说明此csv是binary文件,你应该将它转换为utf-8就能被python读取。

或者参考以下方式读取(但是这样不能使用诸如strip(), split()等str的函数,因为读出来是binary文件)

https://stackoverflow.com/questions/42339876/error-unicodedecodeerror-utf-8-codec-cant-decode-byte-0xff-in-position-0-in

原文:

错误写法:

 image_data = tf.gfile.FastGFile(image, 'r').read()

解决:

#读取图片(文件在下)
“`
image_data = tf.gfile.FastGFile(image, ‘rb’).read()


参考:
https://stackoverflow.com/questions/42339876/error-unicodedecodeerror-utf-8-codec-cant-decode-byte-0xff-in-position-0-in
Python tries to convert a byte-array (a bytes which it assumes to be a utf-8-encoded string) to a unicode string (str). This process of course is a decoding according to utf-8 rules. When it tries this, it encounters a byte sequence which is not allowed in utf-8-encoded strings (namely this 0xff at position 0).

Since you did not provide any code we could look at, we only could guess on the rest.

From the stack trace we can assume that the triggering action was the reading from a file (contents
 = open(path).read()). I propose to recode this in a fashion like this:

with open(path, ‘rb’) as f:
contents = f.read()

“`
==That b in the format specifier in the open() states that the file shall be treated as binary, so contents will remain a bytes. No decoding attempt will happen this way.==

猜你喜欢

转载自blog.csdn.net/paulkg12/article/details/80290753