MMR: a tool for read multi-mapper resolution.

André Kahles,Gunnar Rätsch,Jonas Behr

doi:10.1093/bioinformatics/btv624

André Kahles, Gunnar Rätsch + Show 1 more

Open Access

https://doi.org/10.1093/bioinformatics/btv624

Copy DOI

Journal: Bioinformatics	Publication Date: Oct 30, 2015
Citations: 43	License type: CC BY 4.0

Affiliation: Memorial Sloan Kettering Cancer Center

Abstract

Motivation: Mapping high-throughput sequencing data to a reference genome is an essential step for most analysis pipelines aiming at the computational analysis of genome and transcriptome sequencing data. Breaking ties between equally well mapping locations poses a severe problem not only during the alignment phase but also has significant impact on the results of downstream analyses. We present the multi-mapper resolution (MMR) tool that infers optimal mapping locations from the coverage density of other mapped reads. Results: Filtering alignments with MMR can significantly improve the performance of downstream analyses like transcript quantitation and differential testing. We illustrate that the accuracy (Spearman correlation) of transcript quantification increases by 15% when using reads of length 51. In addition, MMR decreases the alignment file sizes by more than 50%, and this leads to a reduced running time of the quantification tool. Our efficient implementation of the MMR algorithm is easily applicable as a post-processing step to existing alignment files in BAM format. Its complexity scales linearly with the number of alignments and requires no further inputs. Availability and implementation: Open source code and documentation are available for download at http://github.com/ratschlab/mmr. Comprehensive testing results and further information can be found at http://bioweb.me/mmr. Contact: andre.kahles@ratschlab.org or gunnar.ratsch@ratschlab.org Supplementary information: Supplementary data are available at Bioinformatics online.

Highlights

Addressing the increasing need for fast and accurate mapping of high throughput sequencing data to a reference sequence, many different software tools have been developed over the past years, many of which are frequently updated and improved [10, 6, 3, 7]
We present a simple, yet powerful tool, called the Multi-Mapper Resolution tool (MMR), that assigns each read to a unique mapping location in a way that the overall read coverage across the genome is as uniform as possible
We show that this strategy has a positive influence on downstream analyses, such as transcript quantification and prediction

Summary

Introduction

Addressing the increasing need for fast and accurate mapping of high throughput sequencing data to a reference sequence, many different software tools have been developed over the past years, many of which are frequently updated and improved [10, 6, 3, 7]. For the remaining, still significantly large, fraction of reads (≈10–20%, depending on alignment sensitivity), several possible mapping locations exist. We present a simple, yet powerful tool, called the Multi-Mapper Resolution tool (MMR), that assigns each read to a unique mapping location in a way that the overall read coverage across the genome is as uniform as possible. MMR makes use of the critical fraction of unambiguously aligned reads and iteratively selects the alignments of ambiguously mapping reads in a way the overall coverage becomes more uniform. We show that this strategy has a positive influence on downstream analyses, such as transcript quantification and prediction

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MMR: a tool for read multi-mapper resolution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

Abstract 2373: Deep computational analysis of human and mouse specific next-generation sequencing data generated from PDX specimen
Roopika Menon ... Johannes M Heuckmann
Cancer Research | VOL. 74
Roopika Menon, et. al.Roopika Menon ... Johannes M Heuckmann
30 Sep 2014
Cancer Research | VOL. 74

Neuropeptide precursors and neuropeptides in the sea cucumber Apostichopus japonicus: a genomic, transcriptomic and proteomic analysis
Muyan Chen ... Alzbeta Talarovicova
Scientific Reports | VOL. 9
Muyan Chen, et. al.Muyan Chen ... Alzbeta Talarovicova
20 Jun 2019
Scientific Reports | VOL. 9

Enabling large-scale next-generation sequence assembly with Blacklight
M Brian Couger ... Christopher E Mason
-
M Brian Couger, et. al.M Brian Couger ... Christopher E Mason
22 Jul 2013
22 Jul 2013

Enabling large-scale next-generation sequence assembly with Blacklight.
M Brian Couger ... Christopher E Mason
Concurrency and computation : practice & experience | VOL. 26
M Brian Couger, et. al.M Brian Couger ... Christopher E Mason
18 Mar 2014
Concurrency and computation : practice & experience | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MMR: a tool for read multi-mapper resolution.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bioinformatics