Learning immediately: https://edu.csdn.net/course/play/26990/361133?utm_source=blogtoedu
1, the processing is repeated values
df[df.duplicated()]
np.sum(df.duplicated())
df.drop_duplicates()
df.drop_duplicates(subset= ['appname','size'],inplace=True)
To $ 8,000 and the $ ','
def f(x):
if '$' in str(x):
x = str(x).strip('$')
x = str(x).replace(',','')
else:
x = str(x).replace(',','')
return float(x)