quantile()
Location determined quartile two methods, method = POS. 1 (n-+. 1) P = 1+ POS Method 2 (. 1-n-) P
pandas method used is determined. Linear interpolation is used by default
In [213]:
df
Out[213]:
In [217]:
df.
quantile(0.1)
Out[217]:
In [ ]:
# Default using linear interpolation
# Data1 column
# Pos = 1 + (5-1) * 0.1 = 1.4 I = 0.4, -1.700586 + (- 1.337130 - (- 1.700586)) * 0.4 = -1.555204
In [229]:
DF. Quantile ([ 0.05, 0.95]) # Note brackets
Out[229]:
In [260]:
def cap_outliers(ser,lower,higher):
low,high=ser.quantile([lower,higher])
ser[ser<low]=low
ser[ser>high]=high
return (ser)
cap_outliers(df['data1'],0.05,0.95)
Out[260]: