Clustering index Rand Index

Suppose a set of N articles

There are a set of N (N-1) / 2 sets of

TP: the same kind of articles have been assigned to the same cluster

TN: different types of articles are assigned to different clusters

FP: different types of articles are assigned to the same cluster

FN: the same class were assigned to different clusters article

The correct measure of the percentage of Rand Index

RI = (TP+TN)/(TP+FP+FN+TN)


Thus, FP = 40-20 = 20.

Similarly,



Accordingly, TN = 96-24 = 72.

 

Same cluster

Different clusters

Same class

TP=20

FN=24

Different classes

FP=20

TN=72

RI = (20 + 72) / (20 + 20 + 24 + 72) = 0.68

 

Precision=TP/(TP+FP)

Recall=TP/(TP+FN)

F1 = 2×Recall×Precision/(Recall+Precision)

 

Precision=20/40=0.5

Recall=20/44=0.455

F1 = (2 * 0.5 * 0.455) / (0.5 + 0.455) = 0.48

Guess you like

Origin blog.csdn.net/ws_developer/article/details/50751065