pandas将DataFrame中的重复项挑出

a = df.drop_duplicates(subset=['微博id'],keep='first')
b = df.drop_duplicates(subset=['微博id'],keep=False)
f=a.append(b).drop_duplicates(subset=['微博id'],keep=False)

即将DataFrame中微博id这一series中的重复项挑出来了,f就是重复的

猜你喜欢

转载自blog.csdn.net/qq_40920203/article/details/104804705
今日推荐