Abstract

BackgroundThe development of next-generation sequencing technologies has facilitated the identification of rare variants. Family-based design is commonly used to effectively control for population admixture and substructure, which is more prominent for rare variants. Case-parents studies, as typical strategies in family-based design, are widely used in rare variant-disease association analysis. Current methods in case-parents studies are based on complete case-parents data; however, parental genotypes may be missing in case-parents trios, and removing these data may lead to a loss in statistical power. The present study focuses on testing for rare variant-disease association in case-parents study by allowing for missing parental genotypes.ResultsIn this report, we extended the collapsing method for rare variant association analysis in case-parents studies to allow for missing parental genotypes, and investigated the performance of two methods by using the difference of genotypes between affected offspring and their corresponding “complements” in case-parent trios and TDT framework. Using simulations, we showed that, compared with the methods just only using complete case-parents data, the proposed strategy allowing for missing parental genotypes, or even adding unrelated affected individuals, can greatly improve the statistical power and meanwhile is not affected by population stratification.ConclusionsWe conclude that adding case-parents data with missing parental genotypes to complete case-parents data set can greatly improve the power of our strategy for rare variant-disease association.

Highlights

  • The development of next-generation sequencing technologies has facilitated the identification of rare variants

  • We investigate the performance of the proposed method in a homogenous population as well as in populations with population stratification under three scenarios: complete caseparents data mixed with one parental genotypes missing, complete case-parents data mixed with both parental genotypes missing, and complete case-parents data mixed with one and both parental genotypes missing

  • In order to further show the magnitude of power improvement of Z~ C and transmission/disequilibrium test (TDT) burden of rare variants (BRV), we present in parentheses in Tables 2, 3 and 4 the proportion of power improved by adding case-parents trios of ΩΙ, ΩΙΙ, and ΩΙ+ΙΙto complete case-parents data set Ω0

Read more

Summary

Introduction

The development of next-generation sequencing technologies has facilitated the identification of rare variants. Case-parents studies, as typical strategies in family-based design, are widely used in rare variant-disease association analysis. It is often difficult to recruit large enough samples for case-parents study, especially for rare disease, and the sample size is generally small. Discarding those families with missing one or both parental genotypes can lead to statistical power loss. Statistical methods in case-parents study allowing for missing parental genotypes have been widely developed for common variant-disease association analysis [14, 15]. It is useful to develop statistical approaches in case-parents study by allowing for missing parental genotypes to test rare variant-disease association

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call