The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

Takaya Saito,Marc Rehmsmeier

doi:10.1371/journal.pone.0118432

Abstract

Binary classifiers are routinely evaluated with performance measures such as sensitivity and specificity, and performance is frequently illustrated with Receiver Operating Characteristics (ROC) plots. Alternative measures such as positive predictive value (PPV) and the associated Precision/Recall (PRC) plots are used less frequently. Many bioinformatics studies develop and evaluate classifiers that are to be applied to strongly imbalanced datasets in which the number of negatives outweighs the number of positives significantly. While ROC plots are visually appealing and provide an overview of a classifier's performance across a wide range of specificities, one can ask whether ROC plots could be misleading when applied in imbalanced classification scenarios. We show here that the visual interpretability of ROC plots in the context of imbalanced datasets can be deceptive with respect to conclusions about the reliability of classification performance, owing to an intuitive but wrong interpretation of specificity. PRC plots, on the other hand, can provide the viewer with an accurate prediction of future classification performance due to the fact that they evaluate the fraction of true positives among positive predictions. Our findings have potential implications for the interpretation of a large number of studies that use ROC plots on imbalanced datasets.

Highlights

Binary classifiers are statistical and computational models that divide a dataset into two groups, positives and negatives
Through the Results section, we aim to show how evaluation measures act under imbalanced datasets from several different perspectives
Receiver Operating Characteristics (ROC) is a popular and strong measure to evaluate the performance of binary classifiers

Summary

Introduction

Binary classifiers are statistical and computational models that divide a dataset into two groups, positives and negatives. They have been successfully applied to a wide range of biological and medical problems in recent years [1,2,3]. Used measures of classifier performance in the phase of model construction are accuracy, error rate, and the Area under the Receiver Operating Characteristics (ROC) curve (AUC) [4]. Various additional measures are useful for the evaluation of the final model, and several plots provide visual representations, such as ROC and Precision-Recall (PRC) plots [5].

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLOS ONE	Publication Date: Mar 4, 2015
Citations: 2696	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE

Lead the way for us

Similar Papers

The ROC Diagonal is Not Layperson’s Chance: A New Baseline Shows the Useful Area
André M Carrington ... Nick D James
-
André M Carrington, et. al.André M Carrington ... Nick D James
01 Jan 2021
01 Jan 2021

Comparison of tests of stress-released cortisol secretion in pituitary disease.
S M Orme ... S R Peacey
Clinical endocrinology | VOL. 45
S M Orme, et. al.S M Orme ... S R Peacey
01 Aug 1996
Clinical endocrinology | VOL. 45

A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms
André M Carrington ... Franz Mayr
BMC Medical Informatics and Decision Making | VOL. 20
André M Carrington, et. al.André M Carrington ... Franz Mayr
06 Jan 2020
BMC Medical Informatics and Decision Making | VOL. 20

Identification of metabolic bone disease in patients with endogenous hyperthyroidism: role of biological markers of bone turnover.
E Jódar Gimeno ... J D Luna Del Castillo
Calcified tissue international | VOL. 61
E Jódar Gimeno, et. al.E Jódar Gimeno ... J D Luna Del Castillo
01 Nov 1997
Calcified tissue international | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS ONE