Abstract

Copy number variants (CNVs) have been proposed as a possible source of ‘missing heritability’ in complex human diseases. Two studies of type 1 diabetes (T1D) found null associations with common copy number polymorphisms, but CNVs of low frequency and high penetrance could still play a role. We used the Log-R-ratio intensity data from a dense single nucleotide polymorphism (SNP) array, ImmunoChip, to detect rare CNV deletions (rDELs) and duplications (rDUPs) in 6808 T1D cases, 9954 controls and 2206 families with T1D-affected offspring. Initial analyses detected CNV associations. However, these were shown to be false-positive findings, failing replication with polymerase chain reaction. We developed a pipeline of quality control (QC) tests that were calibrated using systematic testing of sensitivity and specificity. The case–control odds ratios (OR) of CNV burden on T1D risk resulting from this QC pipeline converged on unity, suggesting no global frequency difference in rDELs or rDUPs. There was evidence that deletions could impact T1D risk for a small minority of cases, with enrichment for rDELs longer than 400 kb (OR = 1.57, P = 0.005). There were also 18 de novo rDELs detected in affected offspring but none for unaffected siblings (P = 0.03). No specific CNV regions showed robust evidence for association with T1D, although frequencies were lower than expected (most less than 0.1%), substantially reducing statistical power, which was examined in detail. We present an R-package, plumbCNV, which provides an automated approach for QC and detection of rare CNVs that can facilitate equivalent analyses of large-scale SNP array datasets.

Highlights

  • ResultsData quality controlExtensive testing and configuration of QC procedures were conducted on this dataset, as detailed in the ‘Materials and Methods’section and Supplementary Material, Methods S1–S11

  • In order to overcome the daunting task of Copy number variants (CNVs) detection, validation, and reduction of bias, we developed a pipeline of Log-R-ratio (LRR) quality control (QC) procedures to reduce the number of artefactual CNV

  • We investigated rare CNVs in deoxyribonucleic acid (DNA) samples from 6808 type 1 diabetes (T1D) cases, 9954 controls and 2206 families genotyped with ImmunoChip, which is one of the largest CNV studies to date using a single array platform

Read more

Summary

Results

Extensive testing and configuration of QC procedures were conducted on this dataset, as detailed in the ‘Materials and Methods’. Subsequent analyses for case–control and family datasets were performed using the set of parameters that produced the best combination of sensitivity, validation and average CNV quality score This comprised the ‘medium’ setting for both SNP and sample QC, the largest number of components for principal components (PC) correction and utilized all of the CNV-QC filters (corresponding to run 20 from Supplementary Material, Table S5). The same meta-analysis showed 0.04% incidence of 16p11.2 rDELs, similar to the observed rate of 0.03% in our control cohort For both rDELs and rDUPs, the incidence in our controls matched very closely, providing a positive control for the sensitivity of our pipeline to detect very rare CNVs. None of the transmission disequilibrium test (TDT) results for CNVRs passed the Bonferroni corrected threshold of P < 10−4. CNVRs that did not have at least one CNV with a quality score above 90%

Discussion
Findings
Materials and Methods
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call