Python的pandas简单操作


import numpy as np
import pandas as pd
titanic_survival=pd.read_csv('titanic_train.csv')
new_titanic_survival=titanic_survival.sort_values("Age",ascending = False)排序升序
print ( new_titanic_survival).head()
age_null_False = age[age_is_null==False]传递缺失值
c = titanic_survival ['Age'][age_is_null ==False]
mean = titanic_survival ['Age'].mean()均值
passenger_survival = titanic_survival.pivot_table(index = 'Pclass',values['Survived','Fare'],aggfunc = np.mean)两者关系求 平均
drop_na_columns=titanic_survival.dropna(axis=1)丢掉
new_titanic_survival = titanic_survival.dropna(axis = 0,subset = ['Sex','Age'])
new_titanic_survival=titanic_survival.sort_values('Age',ascending = False)
itanic_reindexed=titanic_survival.reset_index(drop = True)
def hundredth(columns):
    hundredth_item = columns.iloc[99]
    return  hundredth_item
hundredth=titanic_survival.apply(hundredth)
print (hundredth)函数
series_custom = Series(rt_scores,index=film_names)
series_custom[['Minions(2015)','Leviathan (2014)']]值与索引
sc2 = series_custom.sort_index()索引排序
rt_critics=Series(fandango['RottenTomatoes'].values, index=fandango['FILM'])
rt_users=Series(fandango['RottenTomatoes_User'].values, index=fandango['FILM'])
相同索引 不同列的值

猜你喜欢

转载自blog.csdn.net/weixin_41701299/article/details/79354514