Neural networks for genetic epidemiology: past, present, and future

Alison A Motsinger-Reif,Marylyn D Ritchie

doi:10.1186/1756-0381-1-3

Abstract

During the past two decades, the field of human genetics has experienced an information explosion. The completion of the human genome project and the development of high throughput SNP technologies have created a wealth of data; however, the analysis and interpretation of these data have created a research bottleneck. While technology facilitates the measurement of hundreds or thousands of genes, statistical and computational methodologies are lacking for the analysis of these data. New statistical methods and variable selection strategies must be explored for identifying disease susceptibility genes for common, complex diseases. Neural networks (NN) are a class of pattern recognition methods that have been successfully implemented for data mining and prediction in a variety of fields. The application of NN for statistical genetics studies is an active area of research. Neural networks have been applied in both linkage and association analysis for the identification of disease susceptibility genes.In the current review, we consider how NN have been used for both linkage and association analyses in genetic epidemiology. We discuss both the successes of these initial NN applications, and the questions that arose during the previous studies. Finally, we introduce evolutionary computing strategies, Genetic Programming Neural Networks (GPNN) and Grammatical Evolution Neural Networks (GENN), for using NN in association studies of complex human diseases that address some of the caveats illuminated by previous work.

Highlights

The identification of disease susceptibility genes for complex, multifactorial disease is arguably the most difficult challenge facing human geneticists today [1]
We have reviewed traditional back-propagation Neural networks (NN) and their previous applications in genetic epidemiology for linkage and association studies
We have limited our discussion to back-propagation NN because they are the type of NN most commonly used in genetic epidemiology

Summary

Introduction

The identification of disease susceptibility genes for complex, multifactorial disease is arguably the most difficult challenge facing human geneticists today [1]. They used fully connected feedforward NN architecture with one input layer, one hidden layer, and one output layer representing affection status They simulated multiple data types – including SNP variables along with quantitative and qualitative environmental traits. Many of these approaches use a prediction error fitness measure, such that they select an architecture based on its generalization to new observations [46], while others use a classification error, or training error [43] These methods are used to attempt to get the most learning out of the network, while trying to avoid over-fitting the data [43,45]. One potential solution to the architecture selection problem in NN is to evolve the NN architecture for each data set analyzed using an evolutionary computation approach This will allow the user to avoid common pitfalls associated with having the wrong network architecture. As the field has approached genome-wide association scans, it has become crucial that methods detect associations in the presence of thousands of genetic variables

Conclusion

Moore JH

10. Bellman R

12. Moore JH

14. Skapuro D: Building neural networks New York

17. Anderson J

26. Curtis D

Findings

30. Falk CT

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BioData Mining	Publication Date: Jul 17, 2008
Citations: 73	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Neural networks for genetic epidemiology: past, present, and future

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining

Lead the way for us

Similar Papers

Comparison of Neural Network Optimization Approaches for Studies of Human Genetics
Alison A Motsinger ... Scott M Dudek
-
Alison A Motsinger, et. al.Alison A Motsinger ... Scott M Dudek
01 Jan 2006
01 Jan 2006

Evaluation of Parameter Contribution to Neural Network Size and Fitness in ATHENA for Genetic Analysis
Ruowang Li ... Emily R Holzinger
-
Ruowang Li, et. al.Ruowang Li ... Emily R Holzinger
01 Jan 2014
01 Jan 2014

Comparison of approaches for machine‐learning optimization of neural networks for detecting gene‐gene interactions in genetic epidemiology
Alison A Motsinger‐Reif ... Scott M Dudek
Genetic Epidemiology | VOL. 32
Alison A Motsinger‐Reif, et. al.Alison A Motsinger‐Reif ... Scott M Dudek
08 Feb 2008
Genetic Epidemiology | VOL. 32

Understanding the Evolutionary Process of Grammatical Evolution Neural Networks for Feature Selection in Genetic Epidemiology.
Alison A Motsinger ... Marylyn D Ritchie
Proceedings of the ... IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology : CIBCB. IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology | VOL. 2006
Alison A Motsinger, et. al.Alison A Motsinger ... Marylyn D Ritchie
01 Sep 2006
01 Sep 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neural networks for genetic epidemiology: past, present, and future

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BioData Mining