Measurements of Generalisation Based on Information Geometry

Huaiyu Zhu,Richard Rohwer

doi:10.1007/978-1-4615-6099-9_69

Measurements of Generalisation Based on Information Geometry

Huaiyu Zhu, Richard Rohwer

Open Access

https://doi.org/10.1007/978-1-4615-6099-9_69

Copy DOI

Publication Date: Jan 1, 1997

Citations: 18

Affiliation: Aston University, Santa Fe Institute, Prediction Systems (United States)

#Information Geometry #Bayesian Posterior + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Neural networks are statistical models and learning rules are estimators. In this paper a theory for measuring generalisation is developed by combining Bayesian decision theory with information geometry. The performance of an estimator is measured by the information divergence between the true distribution and the estimate, averaged over the Bayesian posterior. This unifies the majority of error measures currently in use. The optimal estimators also reveal some intricate interrelationships among information geometry, Banach spaces and sufficient statistics.

Full Text