Adaptive temperature scaling for Robust calibration of deep neural networks

Sergio A. Balanya,Juan Maroñas,Daniel Ramos

doi:10.1007/s00521-024-09505-4

Abstract

In this paper, we study the post-hoc calibration of modern neural networks, a problem that has drawn a lot of attention in recent years. Despite the plethora of calibration methods proposed, there is no consensus yet on the inherent complexity of the task and, while some authors claim that simple functions solve the problem, others suggest that more expressive models are needed to capture misscalibration. As a first approach, we focus on the task of confidence scaling, specifically on post-hoc methods that generalize Temperature Scaling, which we refer to as the Adaptive Temperature Scaling family. We begin by demonstrating that while complex models like neural networks provide an advantage when there is ample data, they fail in scenarios where it is limited, notably common in fields like medical diagnosis. We then show how under this ideal data conditions the more expressive methods learn a relationship between the entropy of a prediction and its level of overconfidence, and based on this observation, we propose Entropy-based Temperature Scaling, a simple method that scales the confidence of a prediction according to this relationship. Results show that our method obtains state-of-the-art performance and is robust against data scarcity. Moreover, our proposed model enables a deeper understanding of the calibration process by the interpretation of the entropy as a measure of uncertainty in the network outputs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive temperature scaling for Robust calibration of deep neural networks

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications

Lead the way for us

Journal: Neural Computing and Applications	Publication Date: Feb 28, 2024
License type: CC BY 4.0

Similar Papers

Gradients as a Measure of Uncertainty in Neural Networks
Jinsol Lee ... Ghassan Alregib
-
Jinsol Lee, et. al.Jinsol Lee ... Ghassan Alregib
01 Oct 2020
01 Oct 2020

Wavelet Neural Network Structure Optimization Method Based on Grey Relational-Sensitivity
Shiying Pan ... Xiuqing Wang
-
Shiying Pan, et. al.Shiying Pan ... Xiuqing Wang
01 Jan 2015
01 Jan 2015

B-jet and c-jet identification with Neural Networks as well as combination of multivariate analyses for the search for of multivariate analyses for the search for single top-quark production
Manuel Renz
-
Manuel RenzManuel Renz
01 Jun 2008
01 Jun 2008

Granular neural networks: A study of optimizing allocation of information granularity in input space
Mingli Song ... Witold Pedrycz
Applied Soft Computing | VOL. 77
Mingli Song, et. al.Mingli Song ... Witold Pedrycz
18 Jan 2019
Applied Soft Computing | VOL. 77

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive temperature scaling for Robust calibration of deep neural networks

Abstract

Talk to us

Similar Papers

More From: Neural Computing and Applications