How reliable is your reliability diagram?

Hyukjun Gweon,Hao Yu

doi:10.1016/j.patrec.2019.07.012

Abstract

It is often necessary to evaluate probabilistic classifiers in terms of the quality of class probability estimates. A popular tool for assessing class probabilities is the reliability diagram, which is based on data binning. While the reliability diagram is visually appealing, it is difficult to statistically determine whether the probabilities are reliable. In this paper, we propose a standardized reliability diagram to assess a binary probabilistic classifier. The proposed method uses the transforms of the Poisson binomial distribution to the normal distribution. The results of the method provide valuable inferences over the (unscaled) reliability diagram. Moreover, we show that the assessment results may be undesirably dependent on the sample size in each bin. As a remedy, we also introduce an approach that chooses an appropriate number of bins for relatively consistent test results regardless of the sample size. Simulation and example results demonstrate the effectiveness of the proposed approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How reliable is your reliability diagram?

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Jul 1, 2019
Citations: 6

Similar Papers

Confidence intervals for the reliability diagram
Jan Verkade
-
Jan VerkadeJan Verkade
15 May 2023
15 May 2023

Stable reliability diagrams for probabilistic classifiers
Timo Dimitriadis ... Alexander I Jordan
Proceedings of the National Academy of Sciences | VOL. 118
Timo Dimitriadis, et. al.Timo Dimitriadis ... Alexander I Jordan
17 Feb 2021
Proceedings of the National Academy of Sciences | VOL. 118

Direct estimation of class membership probabilities for multiclass classification using multiple scores
...
Knowledge and Information Systems | VOL. 19
, et. al. ...
09 Sep 2008
Knowledge and Information Systems | VOL. 19

Towards probabilistic multiclass classification of gamma-ray sources
...
arXiv (Cornell University) | VOL. -
, et. al. ...
21 Sep 2022
arXiv (Cornell University) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How reliable is your reliability diagram?

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters