Differentiable homotopy methods for gradually reinforcing the training of fully connected neural networks

Peixuan Li,Yuanbo Li

doi:10.1016/j.neucom.2024.128374

Abstract

Deep fully connected neural networks (FCNNs) are the workhorses of deep learning and are broadly applicable due to their “agnostic” structure. Generally, the learning capability of FCNNs improves with the increase in the number of layers and the width of each layer, which, however, comes at an increased computational cost in training. To alleviate this difficulty, in this paper, we develop a gradually reinforced differentiable homotopy (GRDH) method to train the FCNNs. Explicitly speaking, by introducing an extra variable t ranging between zero and one, we design a series of auxiliary functions, which are continuous and monotonically increasing in t. With the above functions, we formulate an optimization problem for training an artificial FCNN, which progressively incorporates more layers and nodes into the neural network as t changes from one to zero and eventually becomes an FCNN with a target number of layers or width of nodes. We prove that the set of solutions to the artificial problem contains an everywhere differentiable path, which starts from a uniquely given point at t=1 and ends at the weights and biases of the target FCNN as t goes to zero. The proposed GRDH method is a novel method that incorporates the differentiable homotopy methods into the training of deep learning methods, and retains the satisfactory theoretical convergence property the classical homotopy methods possess. To promote the application of the GRDH method, we implement it and another efficient method called HTA to train the same FCNNs and find that the GRDH method outperforms the HTA both in the computational time and number of iterations for obtaining a solution with similar (even higher) accuracy. Numerical results further confirm the effectiveness of the GRDH method to solve classification problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Differentiable homotopy methods for gradually reinforcing the training of fully connected neural networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Similar Papers

Predicting the mechanical properties of biopolymer gels using neural networks trained on discrete fiber network data
Yue Leng ... Adrian B Tepole
Computer Methods in Applied Mechanics and Engineering | VOL. 387
Yue Leng, et. al.Yue Leng ... Adrian B Tepole
09 Oct 2021
Computer Methods in Applied Mechanics and Engineering | VOL. 387

Synapse cell optimization and back-propagation algorithm implementation in a domain wall synapse based crossbar neural network for scalable on-chip learning
Divya Kaushik ... Janak Sharda
Nanotechnology | VOL. 31
Divya Kaushik, et. al.Divya Kaushik ... Janak Sharda
22 Jun 2020
Nanotechnology | VOL. 31

Innovative design of metamaterial perfect absorbers via residual fully connected neural network modeling
Shuqin Wang ... Jianping Guo
Optics Communications | VOL. 545
Shuqin Wang, et. al.Shuqin Wang ... Jianping Guo
05 Jul 2023
Optics Communications | VOL. 545

Recognition of adsorption phase transition of polymer on surface by neural network
Li-Wang Sun ... Hong Li
Acta Physica Sinica | VOL. 68
Li-Wang Sun, et. al.Li-Wang Sun ... Hong Li
01 Jan 2019
Acta Physica Sinica | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differentiable homotopy methods for gradually reinforcing the training of fully connected neural networks

Abstract

Talk to us

Similar Papers

More From: Neurocomputing