The Loss Surface Of Deep Linear Networks Viewed Through The Algebraic Geometry Lens

Dhagash Mehta,Tingting Tang,Jonathan Hauenstein,Tianran Chen

doi:10.1109/tpami.2021.3071289

Abstract

By using the viewpoint of modern computational algebraic geometry, we explore properties of the optimization landscapes of deep linear neural network models. After providing clarification on the various definitions of "flat" minima, we show that the geometrically flat minima, which are merely artifacts of residual continuous symmetries of the deep linear networks, can be straightforwardly removed by a generalized L2-regularization. Then, we establish upper bounds on the number of isolated stationary points of these networks with the help of algebraic geometry. Combining these upper bounds with a method in numerical algebraic geometry, we find all stationary points for modest depth and matrix size. We demonstrate that, in the presence of the non-zero regularization, deep linear networks can indeed possess local minima which are not global minima. Finally, we show that even though the number of stationary points increases as the number of neurons (regularization parameters) increases (decreases), higher index saddles are surprisingly rare.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Loss Surface Of Deep Linear Networks Viewed Through The Algebraic Geometry Lens

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence

Lead the way for us

Journal: IEEE Transactions on Pattern Analysis and Machine Intelligence	Publication Date: Jan 1, 2021
Citations: 6

Similar Papers

Numerical algebraic geometry
Anton Leykin
Journal of Software for Algebra and Geometry | VOL. 3
Anton LeykinAnton Leykin
01 Jan 2010
Journal of Software for Algebra and Geometry | VOL. 3

Tropical Methods in Geometry
Ilia Itenberg ... Kris Shaw
Oberwolfach Reports | VOL. 20
Ilia Itenberg, et. al.Ilia Itenberg ... Kris Shaw
21 Dec 2023
Oberwolfach Reports | VOL. 20

Numerically Solving Polynomial Systems with Bertini
Daniel J Bates ... Charles W Wampler
-
Daniel J Bates, et. al.Daniel J Bates ... Charles W Wampler
01 Jan 2013
01 Jan 2013

A Comprehensive Evaluation of Well Completion and Production Performance in Bakken Shale Using Data-Driven Approaches
Shuhua Wang ... Shengnan Chen
-
Shuhua Wang, et. al.Shuhua Wang ... Shengnan Chen
24 Aug 2016
24 Aug 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Loss Surface Of Deep Linear Networks Viewed Through The Algebraic Geometry Lens

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Pattern Analysis and Machine Intelligence