A scalable sparse Cholesky based approach for learning high-dimensional covariance matrices in ordered data

Kshitij Khare,Sang-Yun Oh,Syed Rahman,Bala Rajaratnam

doi:10.1007/s10994-019-05810-5

Abstract

Covariance estimation for high-dimensional datasets is a fundamental problem in machine learning, and has numerous applications. In these high-dimensional settings the number of features or variables p is typically larger than the sample size n. A popular way of tackling this challenge is to induce sparsity in the covariance matrix, its inverse or a relevant transformation. In many applications, the data come with a natural ordering. In such settings, methods inducing sparsity in the Cholesky parameter of the inverse covariance matrix can be quite useful. Such methods are also better positioned to yield a positive definite estimate of the covariance matrix, a critical requirement for several downstream applications. Despite some important advances in this area, a principled approach to general sparse-Cholesky based covariance estimation with both statistical and algorithmic convergence safeguards has been elusive. In particular, the two popular likelihood based methods proposed in the literature either do not lead to a well-defined estimator in high-dimensional settings, or consider only a restrictive class of models. In this paper, we propose a principled and general method for sparse-Cholesky based covariance estimation that aims to overcome some of the shortcomings of current methods, but retains their respective strengths. We obtain a jointly convex formulation for our objective function, and show that it leads to rigorous convergence guarantees and well-defined estimators, even when $$p > n$$ . Very importantly, the approach always leads to a positive definite and symmetric estimator of the covariance matrix. We establish both high-dimensional estimation and selection consistency, and also demonstrate excellent finite sample performance on simulated/real data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A scalable sparse Cholesky based approach for learning high-dimensional covariance matrices in ordered data

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Jun 4, 2019
Citations: 15

Similar Papers

Advances in high-dimensional covariance matrix estimation

-

01 Jan 2015
01 Jan 2015

Approaches to High‐Dimensional Covariance and Precision Matrix Estimations
Jianqing Fan ... Han Liu
-
Jianqing Fan, et. al.Jianqing Fan ... Han Liu
26 Apr 2016
26 Apr 2016

Covariance regularization by thresholding
Peter J Bickel ... Elizaveta Levina
The Annals of Statistics | VOL. 36
Peter J Bickel, et. al.Peter J Bickel ... Elizaveta Levina
01 Dec 2008
The Annals of Statistics | VOL. 36

Covariance Estimation in High Dimensions Via Kronecker Product Expansions
Theodoros Tsiligkaridis ... Alfred O Hero
IEEE Transactions on Signal Processing | VOL. 61
Theodoros Tsiligkaridis, et. al.Theodoros Tsiligkaridis ... Alfred O Hero
01 Nov 2013
IEEE Transactions on Signal Processing | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A scalable sparse Cholesky based approach for learning high-dimensional covariance matrices in ordered data

Abstract

Talk to us

Similar Papers

More From: Machine Learning