sklearn stepping on confusion_matrix and KFold - Code World

sklearn stepping on confusion_matrix and KFold

Enterprise 2023-07-12 06:39:08 views: null

sklearn.metrics.confusion_matrix(y_true, y_pred, labels=None, sample_weight=None)

Although sklearn.metrics.confusion_matrix is very convenient to get the confusion matrix directly, but because I found a problem after practice: sklearn's confusion matrix implementation will automatically reduce the dimension to 1*1 when there is only one class , so I made one by myself:

def calculate_metric(gt, pred): 
    pred[pred>0.5]=1
    pred[pred<1]=0
    TP, FP, TN, FN = 0, 0, 0, 0

    for i in range(len(gt)):
        if gt[i] == 1 and pred[i] == 1:
           TP += 1
        if gt[i] == 0 and pred[i] == 1:
           FP += 1
        if gt[i] == 0 and pred[i] == 0:
           TN += 1
        if gt[i] == 1 and pred[i] == 0:
           FN += 1

    # confusion = confusion_matrix(gt,pred)
    # print(confusion.shape)
    # TP = confusion[1,1]
    # TN = confusion[0,0]
    # FP = confusion[0,1]
    # FN = confusion[1,0]

    return TP, FP, TN, FN

But later it was found that the confusion_matrix was not to blame for the problem. In fact, it was because of the small number of data samples, which led to the fact that each fold in the 50-fold cross-validation cannot guarantee that each fold contains two types of samples. Fortunately:

So I went to understand KFold and StratifiedKFold, the usage of both is the same, just modify the function name:

from sklearn.model_selection import train_test_split, KFold, StratifiedKFold


kf = KFold(n_splits=5,random_state=2023,shuffle=True)

kf = StratifiedKFold(n_splits=5,random_state=2023,shuffle=True)

But there are some differences between the two when doing split

#KFold不需要传入标签
for train_index, validate_index in kf.split(dataset):  
    pass
#StratifiedKFold需要传入标签
for train_index, validate_index in kf.split(dataset,dataset['label']):  
    pass

Switching to StratifiedKFold stratified sampling will not have one fold and only one class, thus avoiding the dimensionality reduction problem of confusion_matrix, so far the problem is solved.

Guess you like

Origin blog.csdn.net/weixin_48144018/article/details/129663521

sklearn stepping on confusion_matrix and KFold

sklearn pisando confusion_matrix y KFold

sklearn tritt auf confusion_matrix und KFold

Detailed explanation of CV and KFold in Sklearn

ML's sklearn: a detailed guide to the explanation of the commonly used function parameters (such as confusion_matrix, etc.) in sklearn.metrics and their usage instructions

sklearn calculates confusion matrix

Sklearn.metrics evaluation method described in (accuracy_score, recall_score, roc_curve, roc_auc_score, confusion_matrix, classification_report)

sklearn K-fold cross validation function using KFold

sklearn.metrics.multilabel_confusion_matrix

Use python to draw confusion matrix (confusion_matrix)

模型评估——混淆矩阵confusion_matrix

sklearn К-кратная кросс функцию проверки с помощью KFold

sklearn наступает на путаницу_матрицу и KFold

Discrepancy between KFlold on the one hand and KFold with shuffle=True and RepeatedKFold on the other hand in sklearn

sklearnがconstruction_matrixとKFoldを踏む

sklearn classifier evaluation indicators (precision rate, confusion matrix, precious-recall-Fmeasur, ROC curve, loss function)

python confusion matrix (confusion_matrix) FP, FN, TP, TN, accuracy rate (Precision), recall (Recall), accuracy (Accuracy) detailed

python: multi-classification-calculate confusion matrix confusion_matrix, precision, recall, f1-score score

Расхождение между KFlold с одной стороны, и KFold с тасованью = True и RepeatedKFold с другой стороны, в sklearn

KFold cross validation

Введение в три метода нормализации и денормализации в библиотеке sklearn

sklearn은 chaos_matrix 및 KFold를 밟고 있습니다.

Путаница матрица (confusion_matrix) Значение

[Machine learning notes] confusion matrix (Confusion Matrix)

Cross-validation KFold k-

Stepping eclipse

Clear and concise confusion matrix

python confusion matrix template

Plot the confusion matrix

Matlab draw confusion matrix

Recommended

Ranking

#2019110700005

What materials and procedures are required for patent transfer

What is the blockchain Ethereum triplet state root transaction root receipt root

Front-end study notes 04 --- About the insertion of html pictures and videos

Documents required for the filing of WeChat Mini Programs in special industries, the filing process of WeChat Mini Programs in special industries, how to file WeChat Mini Programs in special industries

2017 Qingdao-site tournament I The Squared Mosquito Coil

[BZOJ3165][HEOI2013]Segment (line segment tree without marking)

Kettle series: KettleEasyExpand, an open source Kettle universal plugin by Ma Jinju

The latest tutorial on making framework for iOS

DAX Section 6: Statistical Functions

Daily

More

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)