Assign different aggregation functions to different features in pandas groupby - Code World

Assign different aggregation functions to different features in pandas groupby

Others 2022-08-19 12:03:15 views: 0

Arch Desai :

I have a data like below (Instead of 4 columns I have 100 columns)

raw_data = { 
        'age': [52, 52, 24, 24, 24], 
        'a': [4, 24, 31, 2, 3],
        'b': [3, 2, 3, 4, 3],
        'c': [2, 5, 8, 2, 1]}
df = pd.DataFrame(raw_data, columns = [ 'age', 'a', 'b', 'c'])

which results in

    age a   b   c
0   52  4   3   2
1   52  24  2   5
2   24  31  3   8
3   24  2   4   2
4   24  3   3   1

I want to group data by age and find mean of some features and sum of remaining features. I have tried this:

feats = ['a', 'b']
df.groupby('age').agg({feats:['mean'], 'c':['sum']})

Since I have 100 features in real data, I cannot assign functions (I have multiple functions to assign: RMS, Kurtosis, Energy Index, etc) to individual feature (I can but it is very time consuming and not smart) Is there any way I can achieve this?

Scott Boston :

Use dictionary comprehension.

agg_d = {i:'mean' for i in feats}
agg_d['c'] = 'sum'

df.groupby('age').agg(agg_d)

Output:

      a         b   c
age                  
24   12  3.333333  11
52   14  2.500000   7

Update, and you can use multiple aggregation function using a list:

agg_d = {i:['sum','max','first', lambda x: sum(x**2)] for i in feats}
agg_d['c'] = 'sum'

Guess you like

Origin http://10.200.1.11:23101/article/api/json?id=5289&siteId=1

Assign different aggregation functions to different features in pandas groupby

Assign different aggregation functions to different features in pandas groupby

The aggregation function pandas groupby

groupby of pandas aggregation and grouping operations

groupby of pandas aggregation and grouping operations

groupby - 2 of pandas aggregation and grouping operations

Detailed explanation of pandas grouping and aggregation groupby() function

Pandas fills missing values for different categories: clever use of df.transform aggregation method

Different C ++ and C functions

Use groupby to count missing values in different groups

Mapbox aggregation, display different icons according to different states

Functions with different signatures, but the same body

pandas merges different excels (DataFrame) into different sheets (worksheets) of the same excel

Spring-how to assign values to different data type parameters

What happens if i assign object to another object of different class?

Build stateful chain for different events and assign global ID in spark

Merging Pandas DataFrames with keys in different columns

Merging Pandas DataFrames with keys in different columns

pandas reads CSV files with different number of columns

[Ubuntu/server use] Assign different folders to different branches of git (with copying of hidden files)

Two different functions of Gaode map merge

Calls between functions and classes in different files in Python

20201203 Functional programming is different from functions

Different inline extension functions to get viewmodel

The data processing operation pandas --- Polymerization [DESCRIBE] Sort [Take] Polymerization data classification [Series.groupby] Advanced Data Aggregation [DataFrame.groupby]

Pandas groupby.mean() gives me "no number type aggregation"-but .sum() works

How to choose GPU for running projects, summary of different GPU features

Use sklearn to conduct experiments to analyze the impact of different optimizers with different activation functions on loss

Why does assigning with [:] versus iloc[:] yield different results in pandas?

VBA variables, to be released in the process, process functions, different release pattern

Recommended

Ranking

ElasticSearch-- data modeling best practices

Permission Maintenance - Shadow User Backdoor

Refactor the code using MVP mode

Quantitative investment-fundamental model-PVC multi-factor model

Spark Big Data Processing Lecture Notes 3.2 Mastering RDD Operators

Blazor page components (2)

Erlernen von Kenntnissen zur Android-Entwicklung – Kodierung, Verschlüsselung, Hash, Serialisierung und Zeichensätze

About Qi high in JAVA study notes SORM summary detailed personal explanation

Will you calculate the accuracy of the rope displacement sensor in the measurement?

OPENJTAG debugging learning (3): debugging using the gdb command line

Daily

More

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)

2024-04-28(12)

2024-04-27(29)

2024-04-26(22)

2024-04-25(32)

2024-04-24(30)

2024-04-23(30)

2024-04-22(5)