Stability analysis of stochastic gradient descent for homogeneous neural networks and linear classifiers

Alexandre Lemire Paquin,Brahim Chaib-Draa,Philippe Giguère

doi:10.1016/j.neunet.2023.04.028

Abstract

We prove new generalization bounds for stochastic gradient descent when training classifiers with invariances. Our analysis is based on the stability framework and covers both the convex case of linear classifiers and the non-convex case of homogeneous neural networks. We analyze stability with respect to the normalized version of the loss function used for training. This leads to investigating a form of angle-wise stability instead of euclidean stability in weights. For neural networks, the measure of distance we consider is invariant to rescaling the weights of each layer. Furthermore, we exploit the notion of on-average stability in order to obtain a data-dependent quantity in the bound. This data-dependent quantity is seen to be more favorable when training with larger learning rates in our numerical experiments. This might help to shed some light on why larger learning rates can lead to better generalization in some practical scenarios.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stability analysis of stochastic gradient descent for homogeneous neural networks and linear classifiers

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society

Lead the way for us

Journal: Neural networks : the official journal of the International Neural Network Society	Publication Date: Apr 25, 2023
Citations: 4

Similar Papers

Stochastic Natural Gradient Descent by estimation of empirical covariances
Luigi Malago ... Matteucci Matteo
-
Luigi Malago, et. al.Luigi Malago ... Matteucci Matteo
01 Jun 2011
01 Jun 2011

Analysis of stochastic gradient descent in continuous time
Jonas Latz
Statistics and Computing | VOL. 31
Jonas LatzJonas Latz
09 May 2021
Statistics and Computing | VOL. 31

Gradient Descent for Non-convex Problems in Modern Machine Learning

-

27 Jun 2019
27 Jun 2019

To regularize or not: Revisiting SGD with simple algorithms and experimental studies
Wenwu He ... Yang Liu
Expert systems with applications | VOL. 112
Wenwu He, et. al.Wenwu He ... Yang Liu
15 Jun 2018
Expert systems with applications | VOL. 112

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stability analysis of stochastic gradient descent for homogeneous neural networks and linear classifiers

Abstract

Talk to us

Similar Papers

More From: Neural networks : the official journal of the International Neural Network Society