Second-order Derivative Optimization Methods in Deep Learning Neural Networks

Si Yong Lim,King Hann Lim

doi:10.1109/gecost55694.2022.10010399

Abstract

Optimization technique in deep learning neural networks is crucial to improve the parameters learning capability. Due to the recent advancements in technology and computational power, second-order derivative optimization methods have become increasingly popular as they offer additional information regarding the curvature of an objective function. In addition, multiple strategies can be applied based on the Hessian computation and step-length controlling parameter derived from the optimization trajectory when training a deep neural network. This paper demonstrates the latest second-order derivative optimization methods and their corresponding weight update rules in chronological order. The reviewed methods comprise Newton’s method, conjugate gradient, Quasi-Newton, Gauss-Newton, Hessian-free, Kronecker-factored approximate curvature, stochastic diagonal approximate greatest descent, AdaHessian, complex-step directional derivative, and Shampoo algorithms. A comparative study in terms of advantages, limitations, and performance are highlighted among these second-order derivative methods. An overall conclusion and future works are drawn in the paper to provide insights into the future development of optimization techniques.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Second-order Derivative Optimization Methods in Deep Learning Neural Networks

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Automated zoom lens design and second-order derivative optimization methods
Sergey G Menabde ... Vitaly M Klimov
-
Sergey G Menabde, et. al.Sergey G Menabde ... Vitaly M Klimov
31 Aug 2006
31 Aug 2006

Adaptive transfer learning in deep neural networks: Wind power prediction using knowledge transfer from region to region and between different task domains
Aqsa Saeed Qureshi ... Asifullah Khan
Computational Intelligence | VOL. 35
Aqsa Saeed Qureshi, et. al.Aqsa Saeed Qureshi ... Asifullah Khan
09 Aug 2019
Computational Intelligence | VOL. 35

Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks
Kazuki Osawa ... Yohei Tsuji
-
Kazuki Osawa, et. al.Kazuki Osawa ... Yohei Tsuji
01 Jun 2019
01 Jun 2019

Estimation of Nateglinide and Metformin Hydrochloride in Tablet Dosage form by Spectrophotometric Methods
...
Journal of Pharmaceutical Research | VOL. 10
, et. al. ...
01 Jul 2011
Journal of Pharmaceutical Research | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Second-order Derivative Optimization Methods in Deep Learning Neural Networks

Abstract

Talk to us

Similar Papers