Abstract

The ability to target and capture known exons in the human genome, and characterize them by massively parallel sequencing, has led to the identification of the genetic causes of many Mendelian disorders. Several factors suggest that exome sequencing will be the preferred clinical next generation technology for some time to come. Advantages of high sequencing depth include the low cost/coverage compared with genome sequencing, and the fact that non-coding-sequence interpretation is still in the early stages of development. In this study of data from the NIH Undiagnosed Diseases Program (UDP), we investigated a novel approach to quantify the quality of exome sequencing data. We systematically and thoroughly evaluated the genotypable fraction across well-characterized protein-coding exons and found that >88% are genotyped to completion and, on average, >93% of all coding bases were genotyped (with target sequencing efficiency of 96%). We also demonstrate a methodology for robust identification of consistently genotyped exons using a new statistical metric, the index of dispersion. This methodology allowed us to define the overall genotypeability of all 167,717 autosomal exons and 95.5% of these had a reproducible pattern of sequencing. Finally, we developed a computational application to take advantage of the reproducible and predictable pattern to confidently detect homozygous deletion events of protein-coding exons. We exploited the sequence pattern information towards reduction of search complexity to detect homozygous deletion events. Of our 11 predictions of homozygous exon-deletion events, we studied 3, performing wet lab experiments that confirmed and validated each of them. We conclude that our systematic approach to analyzing exome sequence data across our patient cohort provides a powerful computational methodology to evaluate, assess, interpret and predict patterns that are relevant to the pathophysiology of the sequenced individuals.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.