Canonical correlation analysis (CCA) is a popular statistical tool in multivariate analysis. A regularized version is often used to stabilize the estimate. Motivated by recent interests in sketching estimates for linear regression problems which try to address the computational problem associated with massive data sets, here we investigate the sketched estimation for CCA, which includes the random subsampling approach as a special case. Some theoretical results are established based on perturbation theory. The method is also illustrated via some Monte Carlo studies and a real data analysis.
Read full abstract