Stochastic Training of Neural Networks via Successive Convex Approximations.

Simone Scardapane,Paolo Di Lorenzo

doi:10.1109/tnnls.2017.2785361

Abstract

This paper proposes a new family of algorithms for training neural networks (NNs). These are based on recent developments in the field of nonconvex optimization, going under the general name of successive convex approximation techniques. The basic idea is to iteratively replace the original (nonconvex, highly dimensional) learning problem with a sequence of (strongly convex) approximations, which are both accurate and simple to optimize. Different from similar ideas (e.g., quasi-Newton algorithms), the approximations can be constructed using only first-order information of the NN function, in a stochastic fashion, while exploiting the overall structure of the learning problem for a faster convergence. We discuss several use cases, based on different choices for the loss function (e.g., squared loss and cross-entropy loss), and for the regularization of the NN's weights. We experiment on several medium-sized benchmark problems and on a large-scale data set involving simulated physical data. The results show how the algorithm outperforms the state-of-the-art techniques, providing faster convergence to a better minimum. Additionally, we show how the algorithm can be easily parallelized over multiple computational units without hindering its performance. In particular, each computational unit can optimize a tailored surrogate function defined on a randomly assigned subset of the input variables, whose dimension can be selected depending entirely on the available computational power.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stochastic Training of Neural Networks via Successive Convex Approximations.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems

Lead the way for us

Journal: IEEE Transactions on Neural Networks and Learning Systems	Publication Date: Jan 15, 2018
Citations: 13

Similar Papers

Stock market prediction using an improved training algorithm of neural network
Mustain Billah ... Sajjad Waheed
-
Mustain Billah, et. al.Mustain Billah ... Sajjad Waheed
01 Dec 2016
01 Dec 2016

A global convergence PSO training algorithm of neural networks
Ming Li ... Wei Li
-
Ming Li, et. al.Ming Li ... Wei Li
01 Jul 2010
01 Jul 2010

Rate Maximization for Cell-Free Massive MIMO with Low-Resolution ADCs
Yao Zhang ... Haotong Cao
-
Yao Zhang, et. al.Yao Zhang ... Haotong Cao
01 Aug 2019
01 Aug 2019

Leap-frog is a robust algorithm for training neural networks
Johann E W Holm ... Elizabeth C Botha
Network: Computation in Neural Systems | VOL. 10
Johann E W Holm, et. al.Johann E W Holm ... Elizabeth C Botha
01 Jan 1998
Network: Computation in Neural Systems | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stochastic Training of Neural Networks via Successive Convex Approximations.

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Neural Networks and Learning Systems