pandas使用drop_duplicates去除DataFrame重复项参数

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/qq_37486501/article/details/86646972

pandas使用drop_duplicates去除DataFrame重复项参数

DataFrame中存在重复的行或者几行中某几列的值重复,这时候需要去掉重复行,示例如下:
data.drop_duplicates(subset=[‘A’,‘B’],keep=‘first’,inplace=True)

实例:

#保存至csv中
s=({"YYYY":Year,"State":data["State"],"TDRState":TDRState})
submit=pd.DataFrame(data=s)
submit=submit.drop_duplicates(subset=['State','TDRState','YYYY'],keep='first',inplace=False)
submit.to_csv('/Users/liyixin/Desktop/result.csv',index=False)

猜你喜欢

转载自blog.csdn.net/qq_37486501/article/details/86646972