SuSiE PCA: A scalable Bayesian variable selection technique for principal component analysis

Dong Yuan,Nicholas Mancuso

doi:10.1016/j.isci.2023.108181

Dong Yuan, Nicholas Mancuso

Open Access

https://doi.org/10.1016/j.isci.2023.108181

Copy DOI

Journal: iScience	Publication Date: Oct 13, 2023
Citations: 1	License type: cc-by-nc-nd

Affiliation: University of Southern California

Abstract

Latent factor models, like principal component analysis (PCA), provide a statistical framework to infer low-rank representation in various biological contexts. However, feature selection is challenging when this low-rank structure manifests from a sparse subspace. We introduce SuSiE PCA, a scalable sparse latent factor approach that evaluates uncertainty in contributing variables through posterior inclusion probabilities. We validate our model in extensive simulations and demonstrate that SuSiE PCA outperforms other approaches in signal detection and model robustness. We apply SuSiE PCA to multi-tissue expression quantitative trait loci (eQTLs) data from GTEx v8 and identify tissue-specific factors and their contributing eGenes. We further investigate its performance on the large-scale perturbation data and find that SuSiE PCA identifies modules with a higher enrichment of ribosome-related genes than sparse PCA (false discovery rate [FDR] vs. ), while being 18x faster. Overall, SuSiE PCA provides an efficient tool to identify relevant features in high-dimensional biological data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SuSiE PCA: A scalable Bayesian variable selection technique for principal component analysis

Abstract

Talk to us

Similar Papers

More From: iScience

Lead the way for us

Similar Papers

Assessment of input variables determination on the SVM model performance using PCA, Gamma test, and forward selection techniques for monthly stream flow prediction
R Noori ... M Ghafari Gousheh
Journal of Hydrology | VOL. 401
R Noori, et. al.R Noori ... M Ghafari Gousheh
22 Feb 2011
Journal of Hydrology | VOL. 401

Assessment of water quality variations under non-rainy and rainy conditions by principal component analysis techniques in Lake Doam watershed, Korea
Bal Dev Bhattrai ... Sungjin Kwak
Journal of Ecology and Environment | VOL. 38
Bal Dev Bhattrai, et. al.Bal Dev Bhattrai ... Sungjin Kwak
28 May 2015
Journal of Ecology and Environment | VOL. 38

Detecting Anomalous Network Traffic in IoT Networks
Dang Hai Hoang ... Ha Duong Nguyen
-
Dang Hai Hoang, et. al.Dang Hai Hoang ... Ha Duong Nguyen
01 Feb 2019
01 Feb 2019

Accuracy and Data Compression Trade-Offs for Power Quality Disturbance Representation with DWT and PCA techniques
L.B Soares ... S Bampi
Renewable Energy and Power Quality Journal | VOL. -
L.B Soares, et. al.L.B Soares ... S Bampi
01 Mar 2013
Renewable Energy and Power Quality Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SuSiE PCA: A scalable Bayesian variable selection technique for principal component analysis

Abstract

Talk to us

Similar Papers

More From: iScience