Feature Selection for Detecting Gene-Gene Interactions in Genome-Wide Association Studies

Faramarz Dorani,Ting Hu

doi:10.1007/978-3-319-77538-8_3

Abstract

Disease association studies aim at finding the genetic variations underlying complex human diseases in order to better understand the etiology of the disease and to provide better diagnoses, treatment, and even prevention. The non-linear interactions among multiple genetic factors play an important role in finding those genetic variations, but have not always been taken fully into account. This is due to the fact that searching combinations of interacting genetic factors becomes inhibitive as its complexity grows exponentially with the size of data. It is especially challenging for genome-wide association studies (GWAS) where typically more than a million single-nucleotide polymorphisms (SNPs) are under consideration. Dimensionality reduction is thus needed to allow us to investigate only a subset of genetic attributes that most likely have interaction effects. In this article, we conduct a comprehensive study by examining six widely used feature selection methods in machine learning for filtering interacting SNPs rather than the ones with strong individual main effects. Those six feature selection methods include chi-square, logistic regression, odds ratio, and three Relief-based algorithms. By applying all six feature selection methods to both a simulated and a real GWAS datasets, we report that Relief-based methods perform the best in filtering SNPs associated with a disease in terms of strong interaction effects.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Feature Selection for Detecting Gene-Gene Interactions in Genome-Wide Association Studies

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering
Xuan Guo ... Yi Pan
BMC Bioinformatics | VOL. 15
Xuan Guo, et. al.Xuan Guo ... Yi Pan
10 Apr 2014
BMC Bioinformatics | VOL. 15

From genotypes to genometypes: putting the genome back in genome-wide association studies
J H Moore
European Journal of Human Genetics | VOL. 17
J H MooreJ H Moore
11 Mar 2009
European Journal of Human Genetics | VOL. 17

Exploration of the Genetic Basis of GVHD by Genetic Association Studies
Seishi Ogawa ... Sasazuki Takehiko
Biology of Blood and Marrow Transplantation | VOL. 15
Seishi Ogawa, et. al.Seishi Ogawa ... Sasazuki Takehiko
01 Jan 2009
Biology of Blood and Marrow Transplantation | VOL. 15

A novel method to identify high order gene-gene interactions in genome-wide association studies: Gene-based MDR
Sohee Oh ... Taesung Park
BMC Bioinformatics | VOL. 13
Sohee Oh, et. al.Sohee Oh ... Taesung Park
01 Jun 2012
BMC Bioinformatics | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Feature Selection for Detecting Gene-Gene Interactions in Genome-Wide Association Studies

Abstract

Talk to us

Similar Papers