基尼辛普森指数衡量多样性 - 代码天地

基尼辛普森指数衡量多样性

其他 2021-11-19 09:27:24 阅读次数: 0

Simpson index

$\lambda =\sum_{i=1}^{R}p_{i}^{2}$

The measure equals the probability that two entities taken at random from the dataset (with replacement) represent the same type, where $R$ is the total number of types in the dataset.

Gini–Simpson index

The transformation $1-\lambda$ equals the probability that the two entities represent different types.

分布越均衡，该指数越高；分布越集中，该指数越低。

Code

import pandas as pd

def gini_calc(df2):
    sum_ = sum_square = 0
    sum_ = df2['cnt'].sum()
    df2['cnt_prop']=df2['cnt'].apply(lambda x :x/sum_)
    for i in df2['cnt_prop']:
        sum_square += i**2
    return 1-sum_square


################################
df = pd.read_excel('gini.xlsx')
df=df.groupby([df['population'],df['subpopulation'],df['type']],as_index=False).sum()


################################
a=[]
b=[]
c=[]
for name,group in df.groupby([df['population'],df['subpopulation']]):
    index = gini_calc(group)
    a.append(name[0])
    b.append(name[1])
    c.append(index)
 
res={"population":a, "subpopulation":b, "gini_simpson_index":c}
data=pd.DataFrame(res)
result=data.to_csv('gini_result.csv')

猜你喜欢

转载自blog.csdn.net/qq_34276652/article/details/113736686

基尼辛普森指数衡量多样性

多样性指数区别

推荐系统多样性指标衡量

β多样性算法

物种多样性学习之Beta多样性

物种多样性学习之Alpha多样性

基尼指数

基尼Gini指数

生物多样性概念

编码标准的多样性

基尼值和基尼指数

Android的屏幕多样性支持

物种多样性学习 1

基因多样性与多态信息含量

图片的多样性之模式崩溃

Biodiversity Project ：生活多样性项目

R语言计算β多样性

大模型训练数据多样性的重要性

c语言计算基尼指数

Alpha多样性之箱线图绘制

Alpha多样性之箱线图解读

Array对象的多样性。面试题

LiveVideoStackCon 2018展现多媒体技术生态多样性

集成学习-多样性的度量和增强

利用metaphlan2结果计算alpha多样性

文献综述：多样性推荐算法的定义及优化方法

computer planetary——全球生物多样性信息机构 (GBIF)

兴趣探测的多样性解决方案

机器视觉软件开发的多样性

FID与LPIPS等图像质量与多样性指标

今日推荐

中国码农的“35岁魔咒”

蘭雅 CorelDRAW 插件 2024.5.1 国际劳动节版，免费下载

Arc Browser for Windows 1.0 正式 GA

90后程序员开发视频搬运软件、不到一年获利超 700 万，结局很刑！

《美国对全球网络空间安全与发展的威胁和破坏》报告发布

周排行

[编程题]学英语

[codeforces 1288A] Deadline 约数+模

Python的web开发

Docker在Centos 7上的部署

python编码

解决Ubuntu16.04 fatal error: json/json.h: No such file or directory

mysql并发插入

rest接口如何适应jsonp的方案

linux 终端上网设置

高数——等号两边同时求导、积分的解释

每日归档

更多

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)

2024-04-30(1)

2024-04-29(40)

2024-04-28(0)

2024-04-27(56)

2024-04-26(39)

2024-04-25(22)