Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Zhiqi Yan,Shisheng Zhong,Lin Lin,Zhiquan Cui

doi:10.3390/math9172176

Zhiqi Yan, Shisheng Zhong + Show 2 more

Open Access

https://doi.org/10.3390/math9172176

Copy DOI

Journal: Mathematics	Publication Date: Sep 6, 2021
Citations: 20	License type: CC BY 4.0

Affiliation: Harbin Institute of Technology

Abstract

Engineering data are often highly nonlinear and contain high-frequency noise, so the Levenberg–Marquardt (LM) algorithm may not converge when a neural network optimized by the algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural network’s poor convergence commonly associated with the LM algorithm. Specifically, the effects of different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks, and special values of LM neural network parameters were found that could make the LM algorithm converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM algorithm. The algorithm coordinates the descent direction and the descent step by the iteration number, which can prevent falling into the local minimum value and avoid the influence of the parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM algorithm and its variants in terms of accuracy and speed in the context of testing common datasets and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.

Highlights

When applied to real-world data interspersed with high nonlinearity and highfrequency noise, LM neural networks have irreplaceable advantages
This work can guide researchers to carry out some appropriate strategies such as training data preprocessing and weights intervention at the beginning of training to avoid problems in the use of the LM neural networks when they still insist on using this model
This study proposed a new solution to the problem of LM neural networks: the adaptive LM (AdaLM) algorithm

Summary

Introduction

When applied to real-world data interspersed with high nonlinearity and highfrequency noise, LM neural networks have irreplaceable advantages. The activation functions in the neural networks are not necessarily continuous and differentiable, so the above references are only valid in the current setting without proving that the global optimization can be achieved in the neural network model. We explain in detail the specific factors that cause the cost function to fall into bad local minima by the original LM algorithm, by analyzing the output performance of several activation functions. In view of these factors, the new algorithm makes up for the deficiency of the LM algorithm to train a network efficiently. Given a neural network model f(w), the cost function is the least squares problem: F(w) = Σ(ylabel − f (w)).

Problem of LM Algorithm

Analysis of Falling into “Bad” Local Minima

The Proposed Algorithm

The Influence of Each Algorithm on Activation Function

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics

Lead the way for us

Similar Papers

Performance Analysis of Activation Functions on Convolutional Neural Networks Using Cloud GPU
Vamshi Krishna Kayala ...
-
Vamshi Krishna Kayala, et. al.Vamshi Krishna Kayala ...
01 Jan 2020
01 Jan 2020

Nonlinear All-Optical Diffractive Deep Neural Network with 10.6 μm Wavelength for Image Classification
Yichen Sun ... Paramasivam Senthilkumaran
International Journal of Optics | VOL. 2021
Yichen Sun, et. al.Yichen Sun ... Paramasivam Senthilkumaran
27 Feb 2021
International Journal of Optics | VOL. 2021

Soft++, a multi-parametric non-saturating non-linearity that improves convergence in deep neural architectures
Andrei Ciuparu ... Raul C Mureşan
Neurocomputing | VOL. 384
Andrei Ciuparu, et. al.Andrei Ciuparu ... Raul C Mureşan
12 Dec 2019
Neurocomputing | VOL. 384

Learning with Type-2 Fuzzy activation functions to improve the performance of Deep Neural Networks
Aykut Beke ... Tufan Kumbasar
Engineering Applications of Artificial Intelligence | VOL. 85
Aykut Beke, et. al.Aykut Beke ... Tufan Kumbasar
08 Jul 2019
Engineering Applications of Artificial Intelligence | VOL. 85

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adaptive Levenberg–Marquardt Algorithm: A New Optimization Strategy for Levenberg–Marquardt Neural Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Mathematics