Kalman Optimizer for Consistent Gradient Descent

Xingyi Yang

doi:10.1109/icassp39728.2021.9414588

Abstract

Deep neural networks (DNN) are typically optimized using stochastic gradient descent (SGD). However, the estimation of the gradient using stochastic samples tends to be noisy and unreliable, resulting in large gradient variance and bad convergence. In this paper, we propose Kalman Optimizor (KO), an efficient stochastic optimization algorithm that adopts Kalman filter to make consistent estimation of the local gradient by solving an adaptive filtering problem. Our method reduces estimation variance in stochastic gradient descent by incorporating the historic state of the optimization. It aims to improve noisy gradient direction as well as accelerate the convergence of learning. We demonstrate the effectiveness of the proposed Kalman Optimizer under various optimization tasks where it is shown to achieve superior and robust performance. The code is available at https://github.com/Adamdad/Filter-Gradient-Decent.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Kalman Optimizer for Consistent Gradient Descent

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient mini-batch stochastic gradient descent with Centroidal Voronoi Tessellation for PDE-constrained optimization under uncertainty
Liuhong Chen ... Xiaoming He
Physica D: Nonlinear Phenomena | VOL. 467
Liuhong Chen, et. al.Liuhong Chen ... Xiaoming He
24 May 2024
Physica D: Nonlinear Phenomena | VOL. 467

A modified Adam algorithm for deep neural network optimization
Mohamed Reyad ... Amany M Sarhan
Neural Computing and Applications | VOL. 35
Mohamed Reyad, et. al.Mohamed Reyad ... Amany M Sarhan
25 Apr 2023
Neural Computing and Applications | VOL. 35

Efficient Stochastic Optimization Algorithms with Specific Bioinformatics Applications
Aysegul Bumin
ACM SIGBioinformatics Record | VOL. 12
Aysegul BuminAysegul Bumin
01 Jul 2023
ACM SIGBioinformatics Record | VOL. 12

Improving Deep Neural Networks' Training for Image Classification With Nonlinear Conjugate Gradient-Style Adaptive Momentum.
Bao Wang ... Qiang Ye
IEEE transactions on neural networks and learning systems | VOL. 35
Bao Wang, et. al.Bao Wang ... Qiang Ye
01 Sep 2024
IEEE transactions on neural networks and learning systems | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kalman Optimizer for Consistent Gradient Descent

Abstract

Talk to us

Similar Papers