The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Davide Chicco,Giuseppe Jurman

doi:10.1186/s12864-019-6413-7

Davide Chicco, Giuseppe Jurman

Open Access

https://doi.org/10.1186/s12864-019-6413-7

Copy DOI

Abstract

BackgroundTo evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, accordingly to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet. Accuracy and F1 score computed on confusion matrices have been (and still are) among the most popular adopted metrics in binary classification tasks. However, these statistical measures can dangerously show overoptimistic inflated results, especially on imbalanced datasets.ResultsThe Matthews correlation coefficient (MCC), instead, is a more reliable statistical rate which produces a high score only if the prediction obtained good results in all of the four confusion matrix categories (true positives, false negatives, true negatives, and false positives), proportionally both to the size of positive elements and the size of negative elements in the dataset.ConclusionsIn this article, we show how MCC produces a more informative and truthful score in evaluating binary classifications than accuracy and F1 score, by first explaining the mathematical properties, and then the asset of MCC in six synthetic use cases and in a real genomics scenario. We believe that the Matthews correlation coefficient should be preferred to accuracy and F1 score in evaluating binary classification tasks by all scientific communities.

Highlights

To evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, to the goal of the experiment they are investigating
If a confusion matrix threshold is at disposal, instead, we recommend the usage of the Matthews correlation coefficient over F1 score, and accuracy
We decided to focus on accuracy and F1 score because they are the most common metrics used for binary classification in machine learning

Summary

Introduction

To evaluate binary classifications and their confusion matrices, scientific researchers can employ several statistical rates, to the goal of the experiment they are investigating. Despite being a crucial issue in machine learning, no widespread consensus has been reached on a unified elective chosen measure yet. Accuracy and F1 score computed on confusion matrices have been (and still are) among the most popular adopted metrics in binary classification tasks. These statistical measures can dangerously show overoptimistic inflated results, especially on imbalanced datasets. Answering these questions is the aim of machine learning and computational statistics, nowadays pervasive in analysis of biological and health care datasets, and (2020) 21:6. Typical cases include the application of machine learning methods to microarray gene expressions [10] or to single-nucleotide polymorphisms (SNPs) [11] to classify particular conditions of patients. There are several consolidated and well known facts driving the choice of evaluating measures in the current practice

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Genomics	Publication Date: Jan 2, 2020
Citations: 3032	License type: open-access

R Discovery Prime

R Discovery Prime

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two-class confusion matrix evaluation
Davide Chicco ... Niklas Tötsch
BioData Mining | VOL. 14
Davide Chicco, et. al.Davide Chicco ... Niklas Tötsch
04 Feb 2021
BioData Mining | VOL. 14

The Matthews correlation coefficient (MCC) should replace the ROC AUC as the standard metric for assessing binary classification
Davide Chicco ... Giuseppe Jurman
BioData mining | VOL. 16
Davide Chicco, et. al.Davide Chicco ... Giuseppe Jurman
17 Feb 2023
BioData mining | VOL. 16

Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study.
Nicolas Munsch ... Stefanie Gruarin
Journal of Medical Internet Research | VOL. 22
Nicolas Munsch, et. al.Nicolas Munsch ... Stefanie Gruarin
06 Oct 2020
Journal of Medical Internet Research | VOL. 22

Integrating near-infrared hyperspectral imaging with machine learning and feature selection: Detecting adulteration of extra-virgin olive oil with lower-grade olive oils and hazelnut oil
Derick Malavi ... Sam Van Haute
Current Research in Food Science | VOL. 9
Derick Malavi, et. al.Derick Malavi ... Sam Van Haute
01 Jan 2024
Current Research in Food Science | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics