Abstract

Artificial intelligence (AI) is achieved by optimizing the cost function constructed from learning data. Changing the parameters in the cost function is an AI learning process (or AI learning for convenience). If AI learning is well performed, then the value of the cost function is the global minimum. In order to obtain the well-learned AI learning, the parameter should be no change in the value of the cost function at the global minimum. One useful optimization method is the momentum method; however, the momentum method has difficulty stopping the parameter when the value of the cost function satisfies the global minimum (non-stop problem). The proposed method is based on the momentum method. In order to solve the non-stop problem of the momentum method, we use the value of the cost function to our method. Therefore, as the learning method processes, the mechanism in our method reduces the amount of change in the parameter by the effect of the value of the cost function. We verified the method through proof of convergence and numerical experiments with existing methods to ensure that the learning works well.

Highlights

  • Artificial intelligence (AI) is completed by defining the cost function constructed through an artificial neural network (ANN) from given learning data, and by determining the parameters that minimize this cost function

  • If AI learning is continued as a learning method based on the first order derivative of the cost function, the learning is not performed at this local minimum

  • In order to complete AI learning, we introduce the method of adding the first order derivative of the cost function to the cost function so that the learning is carried out using the global minimum

Read more

Summary

Introduction

Artificial intelligence (AI) is completed by defining the cost function constructed through an artificial neural network (ANN) from given learning data, and by determining the parameters that minimize this cost function. The first problem, the definition of the cost function, is the more data and the more complicated the structure of the ANN For this reason, the cost function is increased the complexity [1,2,3,4,5]. The main purpose of this paper is to solve the second problem, that is, we want to complete AI learning based on the first derivative of the cost function in a cost function that contains many local minimums. This paper is based on the momentum method and adds an adaptive property, which constitutes a step size change with the degree of the cost function It maintains the power of the momentum method, adds adaptive properties to it to make a certain percentage of learning, and constructs a step size according to the amount of the cost function, making it as close as possible to the minimum value of the cost function.

The Optimization Problem and the Momentum Method
Our Proposed Method
Numerical Tests
Three-Dimensional Surface with One Local Minimum
Three-Dimensional Surface with Three Local Minimums
CIFAR-10
Findings
Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.