Abstract
BackgroundCorrectly merged data sets that have been independently genotyped can increase statistical power in linkage and association studies. However, alleles from microsatellite data sets genotyped with different experimental protocols or platforms cannot be accurately matched using base-pair size information alone. In a previous publication we introduced a statistical model for merging microsatellite data by matching allele frequencies between data sets. These methods are implemented in our software MicroMerge version 1 (v1). While MicroMerge v1 output can be analyzed by some genetic analysis programs, many programs can not analyze alignments that do not match alleles one-to-one between data sets. A consequence of such alignments is that codominant genotypes must often be analyzed as phenotypes. In this paper we describe several extensions that are implemented in MicroMerge version 2 (v2).ResultsNotably, MicroMerge v2 includes a new one-to-one alignment option that creates merged pedigree and locus files that can be handled by most genetic analysis software. Other features in MicroMerge v2 enhance the following aspects of control: 1) optimizing the algorithm for different merging scenarios, such as data sets with very different sample sizes or multiple data sets, 2) merging small data sets when a reliable set of allele frequencies are available, and 3) improving the quantity and 4) quality of merged data. We present results from simulated and real microsatellite genotype data sets, and conclude with an association analysis of three familial dyslipidemia (FD) study samples genotyped at different laboratories. Independent analysis of each FD data set did not yield consistent results, but analysis of the merged data sets identified strong association at locus D11S2002.ConclusionThe MicroMerge v2 features will enable merging for a variety of genotype data sets, which in turn will facilitate meta-analyses for powering association analysis.
Highlights
Merged data sets that have been independently genotyped can increase statistical power in linkage and association studies
If the genotyping for a family-based linkage study is distributed by assigning complete families to each laboratory, logarithm of the odds scores can be computed for each data set separately and added to achieve a combined lod score for the linkage study
We considered a set of ten markers D11S1984, D11S2362, D11S1999, D11S1981, D11S1392, D11S2002, D11S2000, D11S1998, D11S4464 and D11S912 that were genotyped on all three data sets
Summary
Merged data sets that have been independently genotyped can increase statistical power in linkage and association studies. Alleles from microsatellite data sets genotyped with different experimental protocols or platforms cannot be accurately matched using base-pair size information alone. In a previous publication we introduced a statistical model for merging microsatellite data by matching allele frequencies between data sets. These methods are implemented in our software MicroMerge version 1 (v1). As increasing the sample size for a genetic analysis improves the statistical power and the ability to detect a genetic effect, complex disease studies are increasingly employing collaboration. An association study is best powered by combining data sets prior to analysis. If combining data sets is attempted manually without the aid of a merging algorithm, it is cumbersome at best and subject to error
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.