Applying statistical learning theory to deep learning

Cédric Gerbelot,Avetik Karagulyan,Stefani Karp,Kavya Ravichandran,Menachem Stern,Nathan Srebro

doi:10.1088/1742-5468/ad3a5f

Cédric Gerbelot, Avetik Karagulyan + Show 4 more

Open Access

https://doi.org/10.1088/1742-5468/ad3a5f

Copy DOI

Abstract

Abstract Although statistical learning theory provides a robust framework to understand supervised learning, many theoretical aspects of deep learning remain unclear;in particular, how different architectures may lead to inductive bias when trained using gradient-based methods. The goal of these lectures is to provide an overview of some of the main questions that arise when attempting to understand deep learning from a learning theory perspective. After a brief reminder on statistical learning theory and stochastic optimization, we discuss implicit bias in the context of benign overfitting. We then move to a general description of the mirror descent algorithm, showing how we may go back and forth between a parameter space and the corresponding function space for a given learning problem, as well as how the geometry of the learning problem may be represented by a metric tensor. Building on this framework, we provide a detailed study of the implicit bias of gradient descent on linear diagonal networks for various regression tasks, showing how the loss function, scale of parameters at initialization and depth of the network may lead to various forms of implicit bias; in particular, transitioning between kernel and feature learning regimes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Applying statistical learning theory to deep learning

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment

Lead the way for us

Journal: Journal of Statistical Mechanics: Theory and Experiment	Publication Date: Oct 30, 2024
License type: iop-standard

Similar Papers

Statistical Foundations of Data Science
Jianqing Fan ... Runze Li
-
Jianqing Fan, et. al.Jianqing Fan ... Runze Li
20 Sep 2020
20 Sep 2020

PENERAPAN PEMBELAJARAN BERDIFERENSIASI DALAM PERSPEKTIF TEORI BELAJAR HUMANISTIK
Dian Aprelia Rukmi ... Titik Mutiah
Jurnal Pendidikan Dasar Flobamorata | VOL. 4
Dian Aprelia Rukmi, et. al.Dian Aprelia Rukmi ... Titik Mutiah
24 Nov 2023
Jurnal Pendidikan Dasar Flobamorata | VOL. 4

A Learning Theory Approach to System Identification
M Vidyasagar ... Rajeeva L Karandikar
IFAC Proceedings Volumes | VOL. 37
M Vidyasagar, et. al.M Vidyasagar ... Rajeeva L Karandikar
01 Jan 2004
IFAC Proceedings Volumes | VOL. 37

Testability and Ockham’s Razor: How Formal and Statistical Learning Theory Converge in the New Riddle of Induction
Daniel Steel
Journal of Philosophical Logic | VOL. 38
Daniel SteelDaniel Steel
07 Aug 2009
Journal of Philosophical Logic | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Applying statistical learning theory to deep learning

Abstract

Talk to us

Similar Papers

More From: Journal of Statistical Mechanics: Theory and Experiment