Formal convergence analysis on deterministic [formula omitted]-regularization based mini-batch learning for RBF networks

Zhaofeng Liu,Chi-Sing Leung,Hing Cheung So

doi:10.1016/j.neucom.2023.02.012

Abstract

Conventional convergence analysis on mini-batch learning is usually based on the stochastic gradient concept, in which we assume that the training data are presented in a random order. Also, some convergence results require that the learning rate should decrease with the number of training cycles, and that the objective function is a smooth function. Practically speaking, a deterministic presentation scheme with a fixed learning rate is more preferable. Hence, there is a gap between theoretical results and actual implementation. This paper aims at filling the gap. We use the radial basis function (RBF) model for nonlinear regression problems as an example to analyze the convergence properties of mini-batch learning. This paper considers a nonsmooth objective function, which consists of three terms. The coexistence of these three terms is able to handle a number of situations. The first term is a conventional training set error. The second term is a quadratic term which is used to suppress the effect of imperfections in the implementation. The last term is an ℓ1-norm term which is used to select important RBF nodes for the resultant network. Note that the ℓ1-norm term is a nonsmooth function. Although a nonsmooth ℓ1-norm is included and the mini-batch algorithm is deterministic, we are still able to derive the convergence properties, including the sufficient conditions for convergence and range of learning rate. With our results, we have a better theoretical understanding on the behaviour of mini-batch learning and obtain some guidelines on choosing the learning rate. The analysis results can be extended to other flat structural neural network models and other objective functions, which are with quadratic terms and ℓ1-norm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Formal convergence analysis on deterministic [formula omitted]-regularization based mini-batch learning for RBF networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Feb 16, 2023
Citations: 3

Similar Papers

A learning algorithm with a gradient normalization and a learning rate adaptation for the mini-batch type learning
Daiki Ito ... Seiichi Koakutsu
-
Daiki Ito, et. al.Daiki Ito ... Seiichi Koakutsu
01 Sep 2017
01 Sep 2017

Optimization methods for non-smooth or noisy objective functions in fluid design problems
R Narducci ... R Haftka
-
R Narducci, et. al.R Narducci ... R Haftka
19 Jun 1995
19 Jun 1995

RBF model of microwave filter using PDGS with defected rectangles
Taobin Jin ... Jie Jin
-
Taobin Jin, et. al.Taobin Jin ... Jie Jin
01 Aug 2010
01 Aug 2010

Development of Adaptive and Factorized Neural Models for MPC of Industrial Systems

-

11 Jan 2017
11 Jan 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formal convergence analysis on deterministic [formula omitted]-regularization based mini-batch learning for RBF networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing