Loss surface of XOR artificial neural networks

Dhagash Mehta,Xiaojun Zhao,David J Wales,Edgar A Bernal

doi:10.1103/physreve.97.052307

Abstract

Training an artificial neural network involves an optimization process over the landscape defined by the cost (loss) as a function of the network parameters. We explore these landscapes using optimization tools developed for potential energy landscapes in molecular science. The number of local minima and transition states (saddle points of index one), as well as the ratio of transition states to minima, grow rapidly with the number of nodes in the network. There is also a strong dependence on the regularization parameter, with the landscape becoming more convex (fewer minima) as the regularization term increases. We demonstrate that in our formulation, stationary points for networks with N_{h} hidden nodes, including the minimal network required to fit the XOR data, are also stationary points for networks with N_{h}+1 hidden nodes when all the weights involving the additional node are zero. Hence, smaller networks trained on XOR data are embedded in the landscapes of larger networks. Our results clarify certain aspects of the classification and sensitivity (to perturbations in the input data) of minima and saddle points for this system, and may provide insight into dropout and network compression.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Loss surface of XOR artificial neural networks

Abstract

Talk to us

Similar Papers

More From: Physical Review E

Lead the way for us

Journal: Physical Review E	Publication Date: May 21, 2018
Citations: 21

Similar Papers

Optimization of ANN Structure Using Adaptive PSO & GA and Performance Analysis Based on Boolean Identities
Amaresh Sahu ... Sushanta Panigrahi
International Journal of Computer and Communication Technology | VOL. -
Amaresh Sahu, et. al.Amaresh Sahu ... Sushanta Panigrahi
01 Oct 2013
International Journal of Computer and Communication Technology | VOL. -

A Natural Algorithmic Approach to the Structural Optimisation of Neural Networks
N P Suraweera ... D N Ranasinghe
-
N P Suraweera, et. al.N P Suraweera ... D N Ranasinghe
01 Dec 2008
01 Dec 2008

Application of Support Vector Regression for Improving the Performance of the Emotion Prediction Model
...
-
, et. al. ...
01 Jan 2012
01 Jan 2012

Neural networks in the analysis of water-soluble sulfonylurea herbicides using an lc/ms
...
-
, et. al. ...
27 Feb 2018
27 Feb 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Loss surface of XOR artificial neural networks

Abstract

Talk to us

Similar Papers

More From: Physical Review E