How good are pathogenicity predictors in detecting benign variants?

Abhishek Niroula,Mauno Vihinen

doi:10.1371/journal.pcbi.1006481

Abhishek Niroula, Mauno Vihinen

Open Access

https://doi.org/10.1371/journal.pcbi.1006481

Copy DOI

Journal: PLOS Computational Biology	Publication Date: Feb 11, 2019
Citations: 86	License type: CC BY 4.0

Affiliation: Lund University

Abstract

Computational tools are widely used for interpreting variants detected in sequencing projects. The choice of these tools is critical for reliable variant impact interpretation for precision medicine and should be based on systematic performance assessment. The performance of the methods varies widely in different performance assessments, for example due to the contents and sizes of test datasets. To address this issue, we obtained 63,160 common amino acid substitutions (allele frequency ≥1% and <25%) from the Exome Aggregation Consortium (ExAC) database, which contains variants from 60,706 genomes or exomes. We evaluated the specificity, the capability to detect benign variants, for 10 variant interpretation tools. In addition to overall specificity of the tools, we tested their performance for variants in six geographical populations. PON-P2 had the best performance (95.5%) followed by FATHMM (86.4%) and VEST (83.5%). While these tools had excellent performance, the poorest method predicted more than one third of the benign variants to be disease-causing. The results allow choosing reliable methods for benign variant interpretation, for both research and clinical purposes, as well as provide a benchmark for method developers.

Highlights

Generation Sequencing (NGS) is widely used in clinical diagnosis as well as in population genetics to investigate patterns of genetic variants in healthy individuals
Variants were obtained from highquality Exome Aggregation Consortium (ExAC) database and selected to have minor allele frequency between 1 and 25%
We investigated further the performances on different populations, allele frequencies, separately for males and females, chromosome wise and for population unique and non-unique variants

Summary

Introduction

Generation Sequencing (NGS) is widely used in clinical diagnosis as well as in population genetics to investigate patterns of genetic variants in healthy individuals. There are on average about 10,000 variants per genome that cause amino acid substitutions [1]. Several databases enable annotation of disease relevance of variants and frequencies among healthy individuals. These include numerous locus specific variation databases (LSDBs) that are curated by experts in the genes and diseases. While LSDBs typically concentrate on individual genes and proteins or diseases, the general databases have much wider scope such as ClinVar [2], Online Mendelian Inheritance in Man (OMIM) [3] and the UniProt Knowledgebase (UniProtKB) [4]

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How good are pathogenicity predictors in detecting benign variants?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Similar Papers

Use of Clinical Exome Sequencing in Isolated Congenital Heart Disease.
Laura Zahavich ... Seema Mital
Circulation: Cardiovascular Genetics | VOL. 10
Laura Zahavich, et. al.Laura Zahavich ... Seema Mital
04 May 2017
Circulation: Cardiovascular Genetics | VOL. 10

Pathogenic variant burden in the ExAC database: an empirical approach to evaluating population data for clinical variant interpretation
Yuya Kobayashi ... Scott E Topper
Genome Medicine | VOL. 9
Yuya Kobayashi, et. al.Yuya Kobayashi ... Scott E Topper
06 Feb 2017
Genome Medicine | VOL. 9

Pathogenic ASXL1 somatic variants in reference databases complicate germline variant interpretation for Bohring-Opitz Syndrome.
Colleen M Carlston ... Hunter R Underhill
Human Mutation | VOL. 38
Colleen M Carlston, et. al.Colleen M Carlston ... Hunter R Underhill
21 Mar 2017
Human Mutation | VOL. 38

A novel MYH7 mutation resulting in Laing distal myopathy in a Chinese family.
Xiang-Yi Liu ... A-Ping Sun
Chinese medical journal | VOL. 132
Xiang-Yi Liu, et. al.Xiang-Yi Liu ... A-Ping Sun
05 Apr 2019
Chinese medical journal | VOL. 132

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How good are pathogenicity predictors in detecting benign variants?

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLOS Computational Biology