Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies

Gina M D'Angelo,C Charles Gu,Dc Rao

doi:10.1186/1753-6561-3-s7-s62

Abstract

Variable selection in genome-wide association studies can be a daunting task and statistically challenging because there are more variables than subjects. We propose an approach that uses principal-component analysis (PCA) and least absolute shrinkage and selection operator (LASSO) to identify gene-gene interaction in genome-wide association studies. A PCA was used to first reduce the dimension of the single-nucleotide polymorphisms (SNPs) within each gene. The interaction of the gene PCA scores were placed into LASSO to determine whether any gene-gene signals exist. We have extended the PCA-LASSO approach using the bootstrap to estimate the standard errors and confidence intervals of the LASSO coefficient estimates. This method was compared to placing the raw SNP values into the LASSO and the logistic model with individual gene-gene interaction. We demonstrated these methods with the Genetic Analysis Workshop 16 rheumatoid arthritis genome-wide association study data and our results identified a few gene-gene signals. Based on our results, the PCA-LASSO method shows promise in identifying gene-gene interactions, and, at this time we suggest using it with other conventional approaches, such as generalized linear models, to narrow down genetic signals.

Highlights

The goal of this paper is to develop and evaluate prediction methods and tools for genome-wide association studies, for variable selection and dimension reduction
We have extended the least absolute shrinkage and selection operator (LASSO) method to estimate standard errors and confidence intervals with the bootstrap
Enough, whether the principal-component score or the raw single-nucleotide polymorphisms (SNPs) values were placed into the LASSO, the final results were the same

Summary

Introduction

The goal of this paper is to develop and evaluate prediction methods and tools for genome-wide association studies, for variable selection and dimension reduction. Technical advances have enabled the collection of massive high-dimensional datasets in such studies. This has called for broadening of the area of research in dimension-reduction techniques to provide methods for prediction and variable selection. During the last decade, Li [1], Tibshirani [2], and Efron et al [3] have paved new directions for dimension-reduction techniques and broadened the area to other applications of prediction, including genetics. We explore extensions of currently existing dimension-reduction methods and variable-

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Proceedings	Publication Date: Dec 1, 2009
Citations: 53	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings

Lead the way for us

Similar Papers

Comparison of three statistical approaches for feature selection for fine-scale genetic population assignment in four pig breeds.
Ichrak Hayah ... Sara Botti
Tropical animal health and production | VOL. 53
Ichrak Hayah, et. al.Ichrak Hayah ... Sara Botti
01 Jul 2021
Tropical animal health and production | VOL. 53

Machine Learning-Based Survival Analysis Reveals Prognostic Clinical and Genetic Insights for Patients with Cutaneous T-Cell Lymphoma
Celine M Schreidah ... Fernando Gallardo
Blood | VOL. 142
Celine M Schreidah, et. al.Celine M Schreidah ... Fernando Gallardo
28 Nov 2023
Blood | VOL. 142

Decision letter: Common genetic variations in telomere length genes and lung cancer: a Mendelian randomisation study and its novel application in lung tumour transcriptome
Ben Voight ... Eduardo L Franco
-
Ben Voight, et. al.Ben Voight ... Eduardo L Franco
08 Dec 2022
08 Dec 2022

Editor's evaluation: Common genetic variations in telomere length genes and lung cancer: a Mendelian randomisation study and its novel application in lung tumour transcriptome
Nicholas E Banovich
-
Nicholas E BanovichNicholas E Banovich
08 Dec 2022
08 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Proceedings