Barzilai–Borwein-based adaptive learning rate for deep learning

Jinxiu Liang,Yong Xu,Chenglong Bao,Yuhui Quan,Hui Ji

doi:10.1016/j.patrec.2019.08.029

Abstract

Learning rate is arguably the most important hyper-parameter to tune when training a neural network. As manually setting right learning rate remains a cumbersome process, adaptive learning rate algorithms aim at automating such a process. Motivated by the success of the Barzilai–Borwein (BB) step-size method in many gradient descent methods for solving convex problems, this paper aims at investigating the potential of the BB method for training neural networks. With strong motivation from related convergence analysis, the BB method is generalized to adaptive learning rate of mini-batch gradient descent. The experiments showed that, in contrast to many existing methods, the proposed BB method is highly insensitive to initial learning rate, especially in terms of generalization performance. Also, the BB method showed its advantages on both learning speed and generalization performance over other available methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Barzilai–Borwein-based adaptive learning rate for deep learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters

Lead the way for us

Journal: Pattern Recognition Letters	Publication Date: Aug 30, 2019
Citations: 23

Similar Papers

Adaptive learning rate optimization algorithms with dynamic bound based on Barzilai-Borwein method
Zhi-Jun Wang ... Xiao-Qin Zhang
Information Sciences | VOL. 634
Zhi-Jun Wang, et. al.Zhi-Jun Wang ... Xiao-Qin Zhang
11 Mar 2023
Information Sciences | VOL. 634

Author response: Neural learning rules for generating flexible predictions and computing the successor representation
Ching Fang ... Dmitriy Aronov
-
Ching Fang, et. al.Ching Fang ... Dmitriy Aronov
12 Oct 2022
12 Oct 2022

Editor's evaluation: Neural learning rules for generating flexible predictions and computing the successor representation
Srdjan Ostojic
-
Srdjan OstojicSrdjan Ostojic
29 Aug 2022
29 Aug 2022

Decision letter: Neural learning rules for generating flexible predictions and computing the successor representation
Arthur Juliani ... Timothy E Behrens
-
Arthur Juliani, et. al.Arthur Juliani ... Timothy E Behrens
29 Aug 2022
29 Aug 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Barzilai–Borwein-based adaptive learning rate for deep learning

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition Letters