Appropriate Learning Rates of Adaptive Learning Rate Optimization Algorithms for Training Deep Neural Networks.

Hideaki Iiduka

doi:10.1109/tcyb.2021.3107415

Appropriate Learning Rates of Adaptive Learning Rate Optimization Algorithms for Training Deep Neural Networks.

Hideaki Iiduka

Open Access

https://doi.org/10.1109/tcyb.2021.3107415

Copy DOI

Journal: IEEE transactions on cybernetics	Publication Date: Dec 1, 2022
Citations: 33

Affiliation: Meiji University

#Learning Rates #Problems In Deep Learning + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This article deals with nonconvex stochastic optimization problems in deep learning. Appropriate learning rates, based on theory, for adaptive-learning-rate optimization algorithms (e.g., Adam and AMSGrad) to approximate the stationary points of such problems are provided. These rates are shown to allow faster convergence than previously reported for these algorithms. Specifically, the algorithms are examined in numerical experiments on text and image classification and are shown in experiments to perform better with constant learning rates than algorithms using diminishing learning rates.

Full Text