Sparse methods for automatic relevance determination

Samuel H Rudy,Themistoklis P Sapsis

doi:10.1016/j.physd.2021.132843

Samuel H Rudy, Themistoklis P Sapsis

Open Access

https://doi.org/10.1016/j.physd.2021.132843

Copy DOI

Journal: Physica D. Nonlinear phenomena	Publication Date: Jan 11, 2021
Citations: 51	License type: publisher-specific-oa

Affiliation: Massachusetts Institute of Technology

Abstract

This work considers methods for imposing sparsity in Bayesian regression with applications in nonlinear system identification. We first review automatic relevance determination (ARD) and analytically demonstrate the need to additional regularization or thresholding to achieve sparse models. We then discuss two classes of methods, regularization based and thresholding based, which build on ARD to learn parsimonious solutions to linear problems. In the case of orthogonal features, we analytically demonstrate favorable performance with regard to learning a small set of active terms in a linear system with a sparse solution. Several example problems are presented to compare the set of proposed methods in terms of advantages and limitations to ARD in bases with hundreds of elements. The aim of this paper is to analyze and understand the assumptions that lead to several algorithms and to provide theoretical and empirical results so that the reader may gain insight and make more informed choices regarding sparse Bayesian regression.

Full Text