Comparison of two classifiers when the data sets are imbalanced: the power of the area under the precision-recall curve as the figure of merit versus the area under the ROC curve

Berkman Sahiner,Weijie Chen,Nicholas Petrick,Aria Pezeshk

doi:10.1117/12.2254742

Abstract

In many two-class problems in automated classification and information retrieval, the classes are imbalanced, and the separation between the positive and negative classes is large. The precision-recall (PR) curve has been suggested as an alternative to the receiver operating characteristic (ROC) curve to characterize the performance of automated systems when the classes are imbalanced, and the area under the precision-recall curve (AUCPR) has been suggested as an alternative performance measure to the area under the ROC curve (AUCROC). AUCPR and AUCROC are distinct measures of performance, even though the relationship between the precision-recall and ROC curves is well-known. In this study, we compared the statistical power of the AUCPR to that of the AUCROC. Our results indicate that the AUCPR can offer a small statistical advantage when the prevalence is low and the separation between the positive and negative classes is large. When the data set is more balanced or the separation between the classes is low or moderate, AUCROC has slightly higher power.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of two classifiers when the data sets are imbalanced: the power of the area under the precision-recall curve as the figure of merit versus the area under the ROC curve

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Application of ROC and PR curves in the evaluation of clinical diagnostic testing
... Y Wang
Zhonghua yu fang yi xue za zhi [Chinese journal of preventive medicine] | VOL. 56
, et. al. ... Y Wang
06 Sep 2022
Zhonghua yu fang yi xue za zhi [Chinese journal of preventive medicine] | VOL. 56

Detection system fusion based on the predictive value curve and its variations
Mark E Oxley ... Christine M Schubert Kabban
-
Mark E Oxley, et. al.Mark E Oxley ... Christine M Schubert Kabban
07 Jun 2018
07 Jun 2018

POS0762 CAN THE SLE-DAS SUBSTITUTE BILAG TO MEASURE LUPUS DISEASE ACTIVITY IN CLINICAL TRIALS? POST-HOC ANALYSIS OF THE BLISS-76 TRIAL
D Jesus ... C Henriques
-
D Jesus, et. al.D Jesus ... C Henriques
19 May 2021
19 May 2021

The relationship between Precision-Recall and ROC curves
Jesse Davis ... Mark Goadrich
-
Jesse Davis, et. al.Jesse Davis ... Mark Goadrich
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of two classifiers when the data sets are imbalanced: the power of the area under the precision-recall curve as the figure of merit versus the area under the ROC curve

Abstract

Talk to us

Similar Papers