Abstract

Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software.

Highlights

  • More than 24,000 tests for 5000 conditions are currently available on the NCBI Genetic Testing Registry (GTR) [http://www.ncbi.nlm.nih.gov/gtr/] and genetic testing has grown from a niche specialty for rare disorders to routine clinical practice [1,2]

  • We evaluated the genetic predisposition to heart disease in a patient with abnormal cardiac electric activity, by comparing base calling from whole genome sequencing (WGS) data with the results obtained from a commercial genetic testing kit (MI Risk plus) based on Illumina VeraCode technology

  • This assessment confirmed the power of WGS analysis but revealed that standard variant calling pipelines based on variant call format (VCF) are limited in their ability to call the patient genotype because they cannot call the genotype in loci homozygous or hemizygous for the reference allele. We addressed these limitations by applying an alternative approach based on the use of the genome VCF file and showed that it allows a much broader characterization of known variants for individuals undergoing clinical care

Read more

Summary

Introduction

More than 24,000 tests for 5000 conditions are currently available on the NCBI Genetic Testing Registry (GTR) [http://www.ncbi.nlm.nih.gov/gtr/] and genetic testing has grown from a niche specialty for rare disorders to routine clinical practice [1,2]. Genetic tests are generally based on the analysis of individual or small panels of variants and are carried out using classical techniques such as allele-specific PCR or ligation-based SNP genotyping assays as implemented, for example, in the Illumina VeraCode platform, Luminex, Sequenom and TaqMan assays. These tests are limited to well-characterized mutations and are often followed by sequencing of the complete gene(s). Bidirectional Sanger sequencing is still considered the ‘gold standard’ in clinical genetic testing [7,8] but several amplicons must be sequenced in order to cover the target region, and this approach has a low sensitivity which becomes limiting when the sample tissue is scarce, e.g. in the analysis of tumors where the percentage of cells carrying the causal mutation must be at least 15–20% for efficient detection

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call