Abstract

BackgroundPopulation-based association studies are used to identify common susceptibility variants for complex genetic traits. These studies are susceptible to confounding from unknown population substructure. Here we apply a model-based clustering approach to our case-control study of stroke among young women to examine if self-reported ethnicity can serve as a proxy for genetic ancestry.FindingsA population-based case-control study of stroke among women aged 15-49 identified 361 cases of first ischemic stroke and 401 age-comparable control subjects. Thirty single nucleotide polymorphisms (SNPs) throughout the genome unrelated to stroke risk and with established ancestry-based allele frequency differences were genotyped in all participants. The Structure program was used to iteratively evaluate for K = 1 to 5 potential genetic-based subpopulations. Evaluating the population as a whole, the Structure output plateaued at K = 2 clusters. 98% of self-reported Caucasians had an estimated probability ≥50% of belonging to Cluster 1, while 94% of self-reported African-Americans had an estimated probability ≥50% of belonging to Cluster 2. Stratifying the participants by self-reported ethnicity and repeating the analyses revealed the presence of two clusters among Caucasians, suggesting that potential substructure may exist.ConclusionsAmong our combined sample of African-American and Caucasian participants there is no large unknown subpopulation and self-reported ethnicity can serve as a proxy for genetic ancestry. Ethnicity-specific analyses indicate that population substructure may exist among the Caucasian participants indicating that further studies are warranted.

Highlights

  • Population-based case-control studies are used to identify common susceptibility variants for complex genetic traits; population stratification may confound their results [1,2]

  • A panel of genetic markers specific to ancestry and unlinked to the disease can be used to evaluate whether self-reported ethnicity can serve as a proxy for genetic ancestry or relatedness [3]

  • Two subpopulations are likely because: Ethnicity specific exploratory analyses indicate some further substructure may be present among the self-reported Caucasians as log Pr (X | K) plateaus at K = 2 and α converges to a value < 0.2

Read more

Summary

Introduction

Population-based case-control studies are used to identify common susceptibility variants for complex genetic traits; population stratification may confound their results [1,2]. A panel of genetic markers specific to ancestry and unlinked to the disease can be used to evaluate whether self-reported ethnicity can serve as a proxy for genetic ancestry or relatedness [3]. We genotyped 30 markers selected because of their differing allele frequencies between European Caucasians and Nigerians (Yoruba). We used these markers to determine whether self-reported ethnicity can accurately approximate ancestry in a large biracial population of stroke cases and controls. Population-based association studies are used to identify common susceptibility variants for complex genetic traits. We apply a model-based clustering approach to our case-control study of stroke among young women to examine if self-reported ethnicity can serve as a proxy for genetic ancestry

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.