The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation

Davide Chicco,Giuseppe Jurman,Niklas Tötsch

doi:10.1186/s13040-021-00244-z

Davide Chicco, Giuseppe Jurman + Show 1 more

Open Access

PDF Available

https://doi.org/10.1186/s13040-021-00244-z

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

Evaluating binary classifications is a pivotal task in statistics and machine learning, because it can influence decisions in multiple areas, including for example prognosis or therapies of patients in critical conditions. The scientific community has not agreed on a general-purpose statistical indicator for evaluating two-class confusion matrices (having true positives, true negatives, false positives, and false negatives) yet, even if advantages of the Matthews correlation coefficient (MCC) over accuracy and F1 score have already been shown.In this manuscript, we reaffirm that MCC is a robust metric that summarizes the classifier performance in a single value, if positive and negative cases are of equal importance. We compare MCC to other metrics which value positive and negative cases equally: balanced accuracy (BA), bookmaker informedness (BM), and markedness (MK). We explain the mathematical relationships between MCC and these indicators, then show some use cases and a bioinformatics scenario where these metrics disagree and where MCC generates a more informative response.Additionally, we describe three exceptions where BM can be more appropriate: analyzing classifications where dataset prevalence is unrepresentative, comparing classifiers on different datasets, and assessing the random guessing level of a classifier. Except in these cases, we believe that MCC is the most informative among the single metrics discussed, and suggest it as standard measure for scientists of all fields. A Matthews correlation coefficient close to +1, in fact, means having high values for all the other confusion matrix metrics. The same cannot be said for balanced accuracy, markedness, bookmaker informedness, accuracy and F1 score.

Highlights

Evaluating the results of a binary classification remains an important challenge in machine learning and computational statistics
The evaluation of binary classifications is an important step in machine learning and statistics, and the four-category confusion matrix has emerged as one of the most powerful and efficient tools to perform it
Since the advantages of Matthews correlation coefficient over accuracy and F1 score have been already unveiled in the past [15], in this study we decided to compare MCC with balanced accuracy, bookmaker informedness, and markedness, by exploring their mathematical relationships and by analyzing some use cases

Summary

Introduction

Evaluating the results of a binary classification remains an important challenge in machine learning and computational statistics. Every time researchers use an algorithm to discriminate the elements of a dataset having two conditions (for example, positive and negative), they can generate a contingency table called two-class confusion matrix representing how many elements were correctly predicted and how many were wrongly classified [1,2,3,4,5,6,7,8]. The best practice suggests to compute the confusion matrices for all the possible cut-offs. These confusion matrices can be used to generate a receiver operating characteristic (ROC) curve [9] or a precision-recall (PR) curve [10]. The AUC ranges between 0 and 1: the closer to 1, the better the binary classification

Objectives

Methods

Results

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BioData Mining	Publication Date: Feb 4, 2021
Citations: 484	License type: open-access

R Discovery Prime

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BioData Mining

Lead the way for us

Similar Papers

The Benefits of the Matthews Correlation Coefficient (MCC) Over the Diagnostic Odds Ratio (DOR) in Binary Classification Assessment
Davide Chicco ... Valery Starovoitov
IEEE Access | VOL. 9
Davide Chicco, et. al.Davide Chicco ... Valery Starovoitov
01 Jan 2020
IEEE Access | VOL. 9

Mind your prevalence!
Sébastien J. J. Guesné ... Thierry Hanser
Journal of Cheminformatics | VOL. 16
Sébastien J. J. Guesné, et. al.Sébastien J. J. Guesné ... Thierry Hanser
15 Apr 2024
Journal of Cheminformatics | VOL. 16

Artificial Intelligence-related Literature in Transplantation: A Practical Guide.
Sook Hyeon Park ... Sanjay Mehrotra
Transplantation | VOL. 105
Sook Hyeon Park, et. al.Sook Hyeon Park ... Sanjay Mehrotra
18 Aug 2020
Transplantation | VOL. 105

Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study.
Nicolas Munsch ... Alistair Martin
Journal of Medical Internet Research | VOL. 22
Nicolas Munsch, et. al.Nicolas Munsch ... Alistair Martin
06 Oct 2020
Journal of Medical Internet Research | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BioData Mining