Incorporating Derivative-Free Convexity with Trigonometric Simplex Designs for Learning-Rate Estimation of Stochastic Gradient-Descent Method

Emre Tokgoz,Hassan Musafer,Ausif Mahmood,Miad Faezipour

doi:10.3390/electronics12020419

Emre Tokgoz, Hassan Musafer + Show 2 more

Open Access

https://doi.org/10.3390/electronics12020419

Copy DOI

Abstract

This paper proposes a novel mathematical theory of adaptation to convexity of loss functions based on the definition of the condense-discrete convexity (CDC) method. The developed theory is considered to be of immense value to stochastic settings and is used for developing the well-known stochastic gradient-descent (SGD) method. The successful contribution of change of the convexity definition impacts the exploration of the learning-rate scheduler used in the SGD method and therefore impacts the convergence rate of the solution that is used for measuring the effectiveness of deep networks. In our development of methodology, the convexity method CDC and learning rate are directly related to each other through the difference operator. In addition, we have incorporated the developed theory of adaptation with trigonometric simplex (TS) designs to explore different learning rate schedules for the weight and bias parameters within the network. Experiments confirm that by using the new definition of convexity to explore learning rate schedules, the optimization is more effective in practice and has a strong effect on the training of the deep neural network.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Jan 13, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Incorporating Derivative-Free Convexity with Trigonometric Simplex Designs for Learning-Rate Estimation of Stochastic Gradient-Descent Method

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Averaged Stochastic Optimization for Medical Image Registration Based on Variance Reduction
Wei Sun ... Dirk H J Poot
-
Wei Sun, et. al.Wei Sun ... Dirk H J Poot
01 Jan 2018
01 Jan 2018

Bi-fidelity stochastic gradient descent for structural optimization under uncertainty
Subhayan De ... Kurt Maute
Computational Mechanics | VOL. 66
Subhayan De, et. al.Subhayan De ... Kurt Maute
03 Aug 2020
Computational Mechanics | VOL. 66

Momentum and stochastic momentum for stochastic gradient, Newton, proximal point and subspace descent methods
Nicolas Loizou ... Peter Richtárik
Computational Optimization and Applications | VOL. 77
Nicolas Loizou, et. al.Nicolas Loizou ... Peter Richtárik
23 Sep 2020
Computational Optimization and Applications | VOL. 77

Kalman-Based Stochastic Gradient Method with Stop Condition and Insensitivity to Conditioning
Vivak Patel
SIAM Journal on Optimization | VOL. 26
Vivak PatelVivak Patel
01 Jan 2015
SIAM Journal on Optimization | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating Derivative-Free Convexity with Trigonometric Simplex Designs for Learning-Rate Estimation of Stochastic Gradient-Descent Method

Abstract

Talk to us

Similar Papers

More From: Electronics