Gradient boosting Bayesian neural networks via Langevin MCMC

George Bai,Rohitash Chandra

doi:10.1016/j.neucom.2023.126726

Abstract

Bayesian neural networks harness the power of Bayesian inference which provides an approach to neural learning that not only focuses on accuracy but also uncertainty quantification. Markov Chain Monte Carlo (MCMC) methods implement Bayesian inference by sampling from the posterior distribution of the model parameters. In the case of Bayesian neural networks, the model parameters refer to weights and biases. MCMC methods suffer from scalability issues in large models, such as deep neural networks with thousands to millions of parameters. In this paper, we present a Bayesian ensemble learning framework that utilizes gradient boosting by combining multiple shallow neural networks (base learners) that are trained by MCMC sampling. We present two Bayesian gradient boosting strategies that employ simple neural networks as base learners with Langevin MCMC sampling. We evaluate the performance of these methods on various classification and time-series prediction problems. We demonstrate that the proposed framework improves the prediction accuracy of canonical gradient boosting while providing uncertainty quantification via Bayesian inference. Furthermore, we demonstrate that the respective methods scale well when the size of the dataset and model increases.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Neurocomputing	Publication Date: Sep 1, 2023
Citations: 2	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Gradient boosting Bayesian neural networks via Langevin MCMC

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Inference of regulatory networks with a convergence improved MCMC sampler
Nilzair B Agostinho ... Karina S Machado
BMC Bioinformatics | VOL. 16
Nilzair B Agostinho, et. al.Nilzair B Agostinho ... Karina S Machado
24 Sep 2015
BMC Bioinformatics | VOL. 16

Partial Logistic Artificial Neural Network with automatic relevance determination and Markov Chain Monte Carlo methods applied in medical survival studies
Corneliu Arsene
-
Corneliu ArseneCorneliu Arsene
01 Oct 2016
01 Oct 2016

Target Detection on Hyperspectral Images Using MCMC and VI Trained Bayesian Neural Networks
Daniel Ries ... Jason Adams
-
Daniel Ries, et. al.Daniel Ries ... Jason Adams
05 Mar 2022
05 Mar 2022

Bayesian Graph Convolutional Neural Networks via Tempered MCMC
Rohitash Chandra ... Pavel N Krivitsky
IEEE Access | VOL. 9
Rohitash Chandra, et. al.Rohitash Chandra ... Pavel N Krivitsky
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Gradient boosting Bayesian neural networks via Langevin MCMC

Abstract

Talk to us

Similar Papers

More From: Neurocomputing