Lifted Proximal Operator Machines

Jia Li,Zhouchen Lin,Cong Fang

doi:10.1609/aaai.v33i01.33014181

Abstract

We propose a new optimization method for training feedforward neural networks. By rewriting the activation function as an equivalent proximal operator, we approximate a feedforward neural network by adding the proximal operators to the objective function as penalties, hence we call the lifted proximal operator machine (LPOM). LPOM is block multiconvex in all layer-wise weights and activations. This allows us to use block coordinate descent to update the layer-wise weights and activations. Most notably, we only use the mapping of the activation function itself, rather than its derivative, thus avoiding the gradient vanishing or blow-up issues in gradient based training methods. So our method is applicable to various non-decreasing Lipschitz continuous activation functions, which can be saturating and non-differentiable. LPOM does not require more auxiliary variables than the layer-wise activations, thus using roughly the same amount of memory as stochastic gradient descent (SGD) does. Its parameter tuning is also much simpler. We further prove the convergence of updating the layer-wise weights and activations and point out that the optimization could be made parallel by asynchronous update. Experiments on MNIST and CIFAR-10 datasets testify to the advantages of LPOM.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Lifted Proximal Operator Machines

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 23

Similar Papers

Training Neural Networks by Lifted Proximal Operator Machines.
Jia Li ... Mingqing Xiao
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44
Jia Li, et. al.Jia Li ... Mingqing Xiao
31 Dec 2020
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 44

State of Charge Estimation of Lead-Acid Battery with Coulomb Counting and Feed-Forward Neural Network Method
Derry Rifqi Septian Nugraha ... Faiz Husnayain
-
Derry Rifqi Septian Nugraha, et. al.Derry Rifqi Septian Nugraha ... Faiz Husnayain
23 Sep 2020
23 Sep 2020

A fast learning method for feedforward neural networks
Shitong Wang ... Jun Wu
Neurocomputing | VOL. 149
Shitong Wang, et. al.Shitong Wang ... Jun Wu
18 Sep 2014
Neurocomputing | VOL. 149

Neural Networks
Bert Kramer
-
Bert KramerBert Kramer
24 Sep 2004
24 Sep 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Lifted Proximal Operator Machines

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence