OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization.

Agne Antanaviciute,Alexander F Markham,Christopher M Watson,Laura Crinnion,Carolina Lascelles,Ian M Carr,David T Bonthron,Sally M Harrison

doi:10.1093/bioinformatics/btv473

Agne Antanaviciute, Alexander F Markham + Show 6 more

Open Access

https://doi.org/10.1093/bioinformatics/btv473

Copy DOI

Abstract

Motivation: Exome sequencing has become a de facto standard method for Mendelian disease gene discovery in recent years, yet identifying disease-causing mutations among thousands of candidate variants remains a non-trivial task.Results: Here we describe a new variant prioritization tool, OVA (ontology variant analysis), in which user-provided phenotypic information is exploited to infer deeper biological context. OVA combines a knowledge-based approach with a variant-filtering framework. It reduces the number of candidate variants by considering genotype and predicted effect on protein sequence, and scores the remainder on biological relevance to the query phenotype.We take advantage of several ontologies in order to bridge knowledge across multiple biomedical domains and facilitate computational analysis of annotations pertaining to genes, diseases, phenotypes, tissues and pathways. In this way, OVA combines information regarding molecular and physical phenotypes and integrates both human and model organism data to effectively prioritize variants. By assessing performance on both known and novel disease mutations, we show that OVA performs biologically meaningful candidate variant prioritization and can be more accurate than another recently published candidate variant prioritization tool.Availability and implementation: OVA is freely accessible at http://dna2.leeds.ac.uk:8080/OVA/index.jspSupplementary information: Supplementary data are available at Bioinformatics online.Contact: umaan@leeds.ac.uk

Highlights

The application of next-generation sequencing for disease gene discovery or clinical diagnostics can generate large volumes of data, often resulting in identification of thousands of candidate disease genes or variants
We rank each test gene with respect to disease together with 200 randomly selected genes from the pool of all human genes which have at least minimal Gene Ontology annotations in order to avoid any bias, as known disease genes are rarely entirely unannotated
There is a notable difference in performance between the three methods that is consistent across the datasets used

Summary

Introduction

The application of next-generation sequencing for disease gene discovery or clinical diagnostics can generate large volumes of data, often resulting in identification of thousands of candidate disease genes or variants. A healthy individual genome can harbor more than a hundred genuine loss-of-function mutations (MacArthur et al, 2012), making the identification of mutations responsible for a given phenotype a non-trivial task. As systematic experimental verification of each variant is infeasible, several computational prioritization methods have emerged in recent years that attempt to tackle this problem.

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computer applications in the biosciences : CABIOS	Publication Date: Aug 12, 2015
Citations: 25	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer applications in the biosciences : CABIOS

Lead the way for us

Similar Papers

Powerful use of automated prioritization of candidate variants in genetic hearing loss with extreme etiologic heterogeneity
So Young Kim ... Changwon Keum
Scientific Reports | VOL. 11
So Young Kim, et. al.So Young Kim ... Changwon Keum
30 Sep 2021
Scientific Reports | VOL. 11

P-301 Exome sequencing reveals novel candidate variants for endometriosis and endometrial serous adenocarcinoma in a single family having multiple affected members
B.G Kina ... E Oral
Human reproduction (Oxford, England) | VOL. 37
B.G Kina, et. al.B.G Kina ... E Oral
29 Jun 2022
Human reproduction (Oxford, England) | VOL. 37

Integration of genomics and metabolomics for prioritization of rare disease variants: a 2018 literature review
Emma Graham ... Clara D M Van Karnebeek
Journal of inherited metabolic disease | VOL. 41
Emma Graham, et. al.Emma Graham ... Clara D M Van Karnebeek
01 May 2018
Integration of genomics and metabolomics for prioritization of rare disease variants: a 2018 literature review
Emma Graham ... Clara D M Van Karnebeek

Gaining Insights into Inherited Bleeding Disorders of Complex Etiology in Pediatric Patients: Whole-Exome Sequencing as First-Line Investigation Tool.
Irene Corrales ... Edurne Sarrate
Thrombosis and haemostasis | VOL. 124
Irene Corrales, et. al.Irene Corrales ... Edurne Sarrate
29 Dec 2023
Thrombosis and haemostasis | VOL. 124

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

OVA: integrating molecular and physical phenotype data from multiple biomedical domain ontologies with variant filtering for enhanced variant prioritization.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computer applications in the biosciences : CABIOS