An Adaptive Optimization Method Based on Learning Rate Schedule for Neural Networks

Dokkyun Yi,Jieun Park,Sangmin Ji

doi:10.3390/app11020850

Dokkyun Yi, Jieun Park + Show 1 more

Open Access

https://doi.org/10.3390/app11020850

Copy DOI

Journal: Applied Sciences	Publication Date: Jan 18, 2021
Citations: 2	License type: CC BY 4.0

Affiliation: Daegu University, Chungnam National University

Abstract

Artificial intelligence (AI) is achieved by optimizing the cost function constructed from learning data. Changing the parameters in the cost function is an AI learning process (or AI learning for convenience). If AI learning is well performed, then the value of the cost function is the global minimum. In order to obtain the well-learned AI learning, the parameter should be no change in the value of the cost function at the global minimum. One useful optimization method is the momentum method; however, the momentum method has difficulty stopping the parameter when the value of the cost function satisfies the global minimum (non-stop problem). The proposed method is based on the momentum method. In order to solve the non-stop problem of the momentum method, we use the value of the cost function to our method. Therefore, as the learning method processes, the mechanism in our method reduces the amount of change in the parameter by the effect of the value of the cost function. We verified the method through proof of convergence and numerical experiments with existing methods to ensure that the learning works well.

Highlights

Artificial intelligence (AI) is completed by defining the cost function constructed through an artificial neural network (ANN) from given learning data, and by determining the parameters that minimize this cost function
If AI learning is continued as a learning method based on the first order derivative of the cost function, the learning is not performed at this local minimum
In order to complete AI learning, we introduce the method of adding the first order derivative of the cost function to the cost function so that the learning is carried out using the global minimum

Summary

Introduction

Artificial intelligence (AI) is completed by defining the cost function constructed through an artificial neural network (ANN) from given learning data, and by determining the parameters that minimize this cost function. The first problem, the definition of the cost function, is the more data and the more complicated the structure of the ANN For this reason, the cost function is increased the complexity [1,2,3,4,5]. The main purpose of this paper is to solve the second problem, that is, we want to complete AI learning based on the first derivative of the cost function in a cost function that contains many local minimums. This paper is based on the momentum method and adds an adaptive property, which constitutes a step size change with the degree of the cost function It maintains the power of the momentum method, adds adaptive properties to it to make a certain percentage of learning, and constructs a step size according to the amount of the cost function, making it as close as possible to the minimum value of the cost function.

The Optimization Problem and the Momentum Method

Our Proposed Method

Numerical Tests

Three-Dimensional Surface with One Local Minimum

Three-Dimensional Surface with Three Local Minimums

CIFAR-10

Findings

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Adaptive Optimization Method Based on Learning Rate Schedule for Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

An Effective Optimization Method for Machine Learning Based on ADAM
Dokkyun Yi ... Jaehyun Ahn
Applied Sciences | VOL. 10
Dokkyun Yi, et. al.Dokkyun Yi ... Jaehyun Ahn
05 Feb 2020
Applied Sciences | VOL. 10

Applying STEM and extended reality technologies to explore students' artificial intelligence learning performance and behavior for sustainable development goals
Yu-Sheng Su ... Hung-Wei Cheng
Library Hi Tech | VOL. -
Yu-Sheng Su, et. al.Yu-Sheng Su ... Hung-Wei Cheng
22 Mar 2024
Library Hi Tech | VOL. -

Vinsers’ Initiatives: Perceptions on the use of AI in ESL Learning among Selected Grade 11 Learners
Phạm Trần Yên Đan ... Hoàng Hương Giang
International Journal of Science and Management Studies (IJSMS) | VOL. -
Phạm Trần Yên Đan, et. al.Phạm Trần Yên Đan ... Hoàng Hương Giang
29 Feb 2024
International Journal of Science and Management Studies (IJSMS) | VOL. -

Medical imaging and radiation science students' use of artificial intelligence for learning and assessment
Shantel Lewis ... Lisa Vermeulen
Radiography | VOL. 30
Shantel Lewis, et. al.Shantel Lewis ... Lisa Vermeulen
01 Dec 2024
Radiography | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Adaptive Optimization Method Based on Learning Rate Schedule for Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences