Deep Neural Network Confidence Calibration from Stochastic Weight Averaging

Zongjing Cao,Byeong-Seok Shin,Dong-Ho Kim,Yan Li

doi:10.3390/electronics13030503

Zongjing Cao, Byeong-Seok Shin + Show 2 more

Open Access

https://doi.org/10.3390/electronics13030503

Copy DOI

Journal: Electronics	Publication Date: Jan 25, 2024
Citations: 1	License type: CC BY 4.0

Affiliation: Inha University, Soongsil University

Abstract

Overconfidence in deep neural networks (DNN) reduces the model’s generalization performance and increases its risk. The deep ensemble method improves model robustness and generalization of the model by combining prediction results from multiple DNNs. However, training multiple DNNs for model averaging is a time-consuming and resource-intensive process. Moreover, combining multiple base learners (also called inducers) is hard to master, and any wrong choice may result in lower prediction accuracy than from a single inducer. We propose an approximation method for deep ensembles that can obtain ensembles of multiple DNNs without any additional costs. Specifically, multiple local optimal parameters generated during the training phase are sampled and saved by using an intelligent strategy. We use cycle learning rates starting at 75% of the training process and save the weights associated with the minimum learning rate in every iteration. Saved sets of the multiple model parameters are used as weights for a new model to perform forward propagation during the testing phase. Experiments on benchmarks of two different modalities, static images and dynamic videos, show that our method not only reduces the calibration error of the model but also improves the accuracy of the model.

Full Text