Abstract
For population case-control association studies, the false-positive rates can be high due to inappropriate controls, which can occur if there is population admixture or stratification. Moreover, it is not always clear how to choose appropriate controls. Alternatively, the parents or normal sibs can be used as controls of affected sibs. For late-onset complex diseases, parental data are not usually available. One way to study late-onset disorders is to perform sib-pair or sibship analyses. This paper proposes sibship-based Hotelling's T2 test statistics for high-resolution linkage disequilibrium mapping of complex diseases. For a sample of sibships, suppose that each sibship consists of at least one affected sib and at least one normal sib. Assume that genotype data of multiple tightly linked markers/haplotypes are available for each individual in the sample. Paired Hotelling's T2 test statistics are proposed for high-resolution association studies using normal sibs as controls for affected sibs, based on two coding methods: 'haplotype/allele coding' and 'genotype coding'. The paired Hotelling's T2 tests take into account not only the correlation among the markers, but also take the correlation within each sib-pair. The validity of the proposed method is justified by rigorous mathematical and statistical proofs under the large sample theory. The non-centrality parameter approximations of the test statistics are calculated for power and sample size calculations. By carrying out power and simulation studies, it was found that the non-centrality parameter approximations of the test statistics were accurate. By power and type I error analysis, the test statistics based on the 'haplotype/allele coding' method were found to be advantageous in comparison to the test statistics based on the 'genotype coding' method. The test statistics based on multiple markers can have higher power than those based on a single marker. The test statistics can be applied not only for bi-allelic markers, but also for multi-allelic markers. In addition, the test statistics can be applied to analyse the genetic data of multiple markers which contain double heterozygotes -- that is, unknown linkage phase data. An SAS macro, Hotel_sibs.sas, is written to implement the method for data analysis.
Highlights
In recent years, there has been great interest in the research of association studies of complex diseases.[1,2,3,4,5,6] By association studies, we mean linkage disequilibrium (LD) mapping of genetic traits
This paper proposes sibshipbased paired Hotelling’s T 2 test statistics for high-resolution LD mapping of complex diseases
Based an two coding methods—‘haplotype/allele coding’ and ‘genotype coding’—paired Hotelling’s T 2 test statistics TH and TG are proposed for high-resolution association studies, using normal sibs as controls for affected sibs
Summary
There has been great interest in the research of association studies of complex diseases.[1,2,3,4,5,6] By association studies, we mean linkage disequilibrium (LD) mapping of genetic traits. For population case-control studies, the marker allele frequency in cases can be compared with that of controls using x 2 test statistics.[7,8,9,10,11] If there is association between one marker and the trait locus, it is expected that the x 2 tests would lead to significant results. This method can be applied to analyse the data for one marker at a time. With the development of dense maps such as single nucleotide polymorphisms (SNPs), haplotype maps and high-resolution micro-satellites in the human genome, enormous amounts of genetic data on human chromosomes are becoming available.[13,14,15] It is interesting when building appropriate models and useful algorithms in association mapping of complex diseases to have the ability to use multiple markers/haplotypes simultaneously
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.