Abstract

BackgroundThe increasing volume and complexity of high-throughput genomic data make analysis and prioritization of variants difficult for researchers with limited bioinformatics skills. Variant Ranker allows researchers to rank identified variants and determine the most confident variants for experimental validation.ResultsWe describe Variant Ranker, a user-friendly simple web-based tool for ranking, filtering and annotation of coding and non-coding variants. Variant Ranker facilitates the identification of causal variants based on novelty, effect and annotation information. The algorithm implements and aggregates multiple prediction algorithm scores, conservation scores, allelic frequencies, clinical information and additional open-source annotations using accessible databases via ANNOVAR. The available information for a variant is transformed into user-specified weights, which are in turn encoded into the ranking algorithm. Through its different modules, users can (i) rank a list of variants (ii) perform genotype filtering for case-control samples (iii) filter large amounts of high-throughput data based on user custom filter requirements and apply different models of inheritance (iv) perform downstream functional enrichment analysis through network visualization. Using networks, users can identify clusters of genes that belong to multiple ontology categories (like pathways, gene ontology, disease categories) and therefore expedite scientific discoveries. We demonstrate the utility of Variant Ranker to identify causal genes using real and synthetic datasets. Our results indicate that Variant Ranker exhibits excellent performance by correctly identifying and ranking the candidate genesConclusionsVariant Ranker is a freely available web server on http://paschou-lab.mbg.duth.gr/Software.html. This tool will enable users to prioritise potentially causal variants and is applicable to a wide range of sequencing data.

Highlights

  • The increasing volume and complexity of high-throughput genomic data make analysis and prioritization of variants difficult for researchers with limited bioinformatics skills

  • Our results indicate that Variant Ranker exhibits excellent performance by correctly identifying and ranking the candidate genes

  • Analysis of a real exome sequencing dataset on idiopathic hemolytic anemia (MIM: 266200) We used the exome of an individual with idiopathic hemolytic anemia (IHA) for which PKLR was identified as the most likely causative gene [31, 32]. 28,644 variants were ranked reporting PKLR as the 4th rank

Read more

Summary

Introduction

The increasing volume and complexity of high-throughput genomic data make analysis and prioritization of variants difficult for researchers with limited bioinformatics skills. The end result is a ranked list of variants to take forward for functional studies or experimental validation. Using this tool, a ranked list of prioritized variants is generated by computing a single score combining existing and available information present for a variant from several databases. Variant Ranker is applicable to all types of sequencing data using the de factoVCF [14] and ANNOVAR [9]) formats The advantages of this tool are the ease of use, ability to score all variants (coding and non-coding) and flexibility in filtering offered to the user. For the purpose of downstream functional enrichment analysis to discover vital biological connections from a ranked list of variants/genes, the Network Analyser is integrated; a network visualization tool that investigates tabular results from DAVID (database for annotation, visualization and integrated discovery, https://david.ncifcrf.gov) [15, 16] through a network approach

Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call