AMAS: Optimizing the Partition and Filtration of Adaptive Seeds to Speed up Read Mapping

Ngoc Hieu Tran,Xin Chen

doi:10.1109/tcbb.2015.2465900

Abstract

Background: Identifying all possible mapping locations of next-generation sequencing (NGS) reads is highly essential in several applications such as prediction of genomic variants or protein binding motifs located in repeat regions, isoform expression quantification, metagenomics analysis, etc. However, this task is very time-consuming and majority of mapping tools only focus on one or a few best mapping locations. Results: We propose AMAS, an alignment tool specialized in identifying all possible mapping locations of NGS reads in a reference sequence. AMAS features an effective use of adaptive seeds to speed up read mapping while preserving sensitivity. Specifically, an index is designed to pre-store the locations of adaptive seeds in the reference sequence, efficiently reducing the time for seed matching and partitioning. An accurate filtration of adaptive seeds is further applied to substantially tighten the candidate alignment space. As a result, AMAS runs several times faster than other state-of-the-art read mappers while achieving similar accuracy. Conclusions: AMAS provides a valuable resource to speed up the important yet time-consuming task of identifying all mapping locations of NGS reads. AMAS is implemented in C++ based on the SeqAn library and is freely available at https://sourceforge.net/projects/ngsamas/. Keywords: next-generation sequencing, read mapping, sequence alignment, adaptive seeds, seed partition, filtration

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AMAS: Optimizing the Partition and Filtration of Adaptive Seeds to Speed up Read Mapping

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Feb 18, 2015
Citations: 38

Similar Papers

Accurate Prediction of RH Genotypes Using Whole Genome Sequencing Data
Yan Zheng ... Stella T Chou
Blood | VOL. 132
Yan Zheng, et. al.Yan Zheng ... Stella T Chou
29 Nov 2018
Blood | VOL. 132

AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.
Qi Zheng ... Elizabeth A Grice
PLOS Computational Biology | VOL. 12
Qi Zheng, et. al.Qi Zheng ... Elizabeth A Grice
05 Oct 2016
PLOS Computational Biology | VOL. 12

CUSHAW Suite: Parallel and Efficient Algorithms for NGS Read Alignment
Yongchao Liu ... Bertil Schmidt
-
Yongchao Liu, et. al.Yongchao Liu ... Bertil Schmidt
01 Jan 2017
01 Jan 2017

Author response: Tiled-ClickSeq for targeted sequencing of complete coronavirus genomes with simultaneous capture of RNA recombination and minority variants
Elizabeth Jaworski ...
-
Elizabeth Jaworski, et. al.Elizabeth Jaworski ...
03 Sep 2021
03 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AMAS: Optimizing the Partition and Filtration of Adaptive Seeds to Speed up Read Mapping

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics