Supervised Dirichlet Process Mixtures of Principal Component Analysis

Jiangtao Ren,Kang Li,Chaotao Chen

doi:10.1016/j.neucom.2018.04.047

Abstract

Abstract We introduce probabilistic principal component analysis (PPCA) into Dirichlet Process Mixtures of Generalized Linear Models (DPGLM) and propose a new model called Supervised Dirichlet Process Mixtures of Principal Component Analysis (SDPM-PCA). In SDPM-PCA, we assume covariates and response variable are generated separately through the latent variable of PPCA, and nonparametrically modeled using the Dirichlet Process Mixture. By jointly learning the latent variable, cluster label and response variable, SDPM-PCA performs locally dimensionality reduction within each mixture component, and learns a supervised model based on the latent variable. In this way, SDPM-PCA improves the performance of both dimensionality reduction and prediction on high-dimensional data with all advantages of DPGLM. We also develop an inference algorithm for SDPM-PCA based on variational inference, which provides faster training speed and deterministic approximation compared with sampling algorithms based on MCMC method. Finally, we instantiate SDPM-PCA in regression problem with a Bayesian linear regression model. We test it on several real-world datasets and compare the prediction performance with DPGLM and other regular regression model. Experiment results show that by setting properly latent dimension number, SDPM-PCA would provide better prediction performance on high-dimensional regression problem and avoid the curse of dimensionality problem in DPGLM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised Dirichlet Process Mixtures of Principal Component Analysis

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: May 4, 2018
Citations: 2

Similar Papers

Probabilistic principal component analysis for metabolomic data.
Gift Nyamundanda ... Isobel Claire Gormley
BMC Bioinformatics | VOL. 11
Gift Nyamundanda, et. al.Gift Nyamundanda ... Isobel Claire Gormley
23 Nov 2010
BMC Bioinformatics | VOL. 11

Performance Analysis of Dimensionality Reduction Techniques in the Context of Clustering
T Sudha ... P Nagendra Kumar
Asian Journal of Computer Science and Technology | VOL. 8
T Sudha, et. al.T Sudha ... P Nagendra Kumar
05 Jun 2019
Asian Journal of Computer Science and Technology | VOL. 8

Automated hierarchical mixtures of probabilistic principal component analyzers
Ting Su ... Jennifer G Dy
-
Ting Su, et. al.Ting Su ... Jennifer G Dy
01 Jan 2004
01 Jan 2004

Application of Bayesian Inference Model Variational Bayesian Principal Component Analysis (VBPCA) for Handling Missing Data in Principal Component Analysis

-

30 Jun 2016
30 Jun 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised Dirichlet Process Mixtures of Principal Component Analysis

Abstract

Talk to us

Similar Papers

More From: Neurocomputing