Abstract

Genes for complex disorders have proven hard to find using linkage analysis. The results rarely reach the desired level of significance and researchers often have failed to replicate positive findings. There is, however, a wealth of information from other scientific approaches which enables the formation of hypotheses on groups of genes or genomic regions likely to be enriched in disease loci. Examples include genes belonging to specific pathways or producing proteins interacting with known risk factors, genes that show altered expression levels in patients or even the group of top scoring locations in a linkage study. We show here that this hypothesis of enrichment for disease loci can be tested using genome-wide linkage data, provided that these data are independent from the data used to generate the hypothesis. Our method is based on the fact that non-parametric linkage analyses are expected to show increased scores at each one of the disease loci, although this increase might not rise above the noise of stochastic variation. By using a summary statistic and calculating its empirical significance, we show that enrichment hypotheses can be tested with power higher than the power of the linkage scan data to identify individual loci. Via simulated linkage scans for a number of different models, we gain insight in the interpretation of genome scan results and test the power of our proposed method. We present an application of the method to real data from a late-onset Alzheimer's disease linkage scan as a proof of principle.

Highlights

  • In complex disorders where variations in more than one gene are expected to contribute to disease risk, researchers often hypothesise that particular groups of genes or genomic locations are enriched with true disease-susceptibility genes, based on various lines of evidence

  • We propose a method for using genome-wide linkage data and the widely used nonparametric linkage (NPL) score[1] to test for the enrichment of groups of genes or genetic locations in disease-susceptibility genes

  • The NPL scores and the fraction of true positives among the top linkage peaks decrease as the number of disease loci increases and as their relative risk decreases

Read more

Summary

Introduction

In complex disorders where variations in more than one gene are expected to contribute to disease risk, researchers often hypothesise that particular groups of genes or genomic locations are enriched with true disease-susceptibility genes, based on various lines of evidence. Groups of genes found to be differentially expressed in a case-controlled microarray experiment or the human loci syntenic to those identified by linkage in a mouse disease model are likely to be enriched in susceptibility genes. One can hypothesise disease gene enrichment based on functional data It can be suggested, for example, that the genes involved in glutamate neurotransmission are enriched with schizophreniasusceptibility genes, or — combining more than one line of evidence — that differentially expressed glutaminergic genes in particular are likely to be enriched. We propose a method for using genome-wide linkage data and the widely used nonparametric linkage (NPL) score[1] to test for the enrichment of groups of genes or genetic locations in disease-susceptibility genes

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call