Abstract

Copy number variants (CNVs) are suggested to have a widespread impact on the human genome and phenotypes. To understand the role of CNVs across human diseases, we examine the CNV genomic landscape of 100,028 unrelated individuals of European ancestry, using SNP and CGH array datasets. We observe an average CNV burden of ~650 kb, identifying a total of 11,314 deletion, 5625 duplication, and 2746 homozygous deletion CNV regions (CNVRs). In all, 13.7% are unreported, 58.6% overlap with at least one gene, and 32.8% interrupt coding exons. These CNVRs are significantly more likely to overlap OMIM genes (2.94-fold), GWAS loci (1.52-fold), and non-coding RNAs (1.44-fold), compared with random distribution (P < 1 × 10−3). We uncover CNV associations with four major disease categories, including autoimmune, cardio-metabolic, oncologic, and neurological/psychiatric diseases, and identify several drug-repurposing opportunities. Our results demonstrate robust frequency definition for large-scale rare variant association studies, identify CNVs associated with major disease categories, and illustrate the pleiotropic impact of CNVs in human disease.

Highlights

  • Copy number variants (CNVs) are suggested to have a widespread impact on the human genome and phenotypes

  • We identified a total of 11,314 deletion, 5625 duplication, and 2746 homozygous-deletion CNV regions (CNVRs), defined as a contiguous cluster of non-singleton single-nucleotide polymorphism (SNP) that spans

  • We identified candidate genes in DA-CNVRs in a number of wellestablished disease-associated loci, including chr2p24.3 (MYCN amplification in cancer)[24], chr22q11.21 (COMT and TBX1 deletion in neuropsychiatric disease and congenital heart conditions)[25,26,27], and chr17q21.1 (NR1D1, deletion and duplication associated with response to lithium in bipolar disease and major depressive disorder)[28,29]

Read more

Summary

Introduction

Copy number variants (CNVs) are suggested to have a widespread impact on the human genome and phenotypes. We uncover CNV associations with four major disease categories, including autoimmune, cardio-metabolic, oncologic, and neurological/psychiatric diseases, and identify several drugrepurposing opportunities. Our results demonstrate robust frequency definition for large-scale rare variant association studies, identify CNVs associated with major disease categories, and illustrate the pleiotropic impact of CNVs in human disease. Division of Oncology, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA. Copy number variation regions (CNVRs) are significantly more likely to overlap OMIM genes (2.94-fold), genome-wide association studies (GWAS) loci (1.52-fold), and non-coding RNAs (ncRNAs; 1.44fold), compared with random distribution (P < 1 × 10−3). We uncover strong CNV associations with four major disease categories, including autoimmune, cardio-metabolic, oncologic, and neurological/psychiatric diseases, several of which impact genes that represent potential drug targets for future validation

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call