Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure

Xinying Zou,Samir M Perlaza,Eitan Altman,Iñaki Esnaola

doi:10.1609/aaai.v38i15.29674

Abstract

In this paper, the worst-case probability measure over the data is introduced as a tool for characterizing the generalization capabilities of machine learning algorithms. More specifically, the worst-case probability measure is a Gibbs probability measure and the unique solution to the maximization of the expected loss under a relative entropy constraint with respect to a reference probability measure. Fundamental generalization metrics, such as the sensitivity of the expected loss, the sensitivity of the empirical risk, and the generalization gap are shown to have closed-form expressions involving the worst-case data-generating probability measure. Existing results for the Gibbs algorithm, such as characterizing the generalization gap as a sum of mutual information and lautum information, up to a constant factor, are recovered. A novel parallel is established between the worst-case data-generating probability measure and the Gibbs algorithm. Specifically, the Gibbs probability measure is identified as a fundamental commonality of the model space and the data space for machine learning algorithms.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Mar 24, 2024
Citations: 1

Similar Papers

Optimised radar waveform design based on the relative entropy constraint
Yu Xiao ... Xiaoxiang Hu
IET Signal Processing | VOL. 17
Yu Xiao, et. al.Yu Xiao ... Xiaoxiang Hu
08 Nov 2022
IET Signal Processing | VOL. 17

Power Allocation for Covert Wireless Communications in Fading Channels
Haohui Yang ... Li Sun
-
Haohui Yang, et. al.Haohui Yang ... Li Sun
11 Dec 2020
11 Dec 2020

Robust filtering for continuous-time stochastic uncertain systems with relative entropy constraints
V A Ugrinovskii ... I R Petersen
-
V A Ugrinovskii, et. al.V A Ugrinovskii ... I R Petersen
01 Aug 1999
01 Aug 1999

Interaction Model in Statistical Mechanics
Farida Kachapova ... Ilias Kachapov
Journal of Mathematics and Statistics | VOL. 13
Farida Kachapova, et. al.Farida Kachapova ... Ilias Kachapov
01 Apr 2017
Journal of Mathematics and Statistics | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence