aims
- Download 801 Pokémon data, and select 10 Pokémon suitable for cultivation through the analysis of 41 attributes
Dependent environment
- python3
- pandas
- matplotlib
- seaborn
Data set download
!wget -O pokemon_data.csv https://pai-public-data.oss-cn-beijing.aliyuncs.com/pokemon/pokemon.csv
Import Data
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt
df = pd.read_csv("./pokemon_data.csv")
data analysis
df.head()
df.info()
# 计算出每个特征有多少百分比是缺失的
percent_missing = df.isnull().sum() * 100 / len(df)
missing_value_df = pd.DataFrame({
'column_name': df.columns,
'percent_missing': percent_missing
})
# 查看丢失数据的top10属性
missing_value_df.sort_values(by='percent_missing', ascending=False).head(10)