MARS: improving multiple circular sequence alignment using refined sequences

Lorraine A K Ayad,Solon P Pissis

doi:10.1186/s12864-016-3477-5

Lorraine A K Ayad, Solon P Pissis

Open Access

https://doi.org/10.1186/s12864-016-3477-5

Copy DOI

Export

Save

Cite

Journal: BMC Genomics	Publication Date: Jan 14, 2017
Citations: 44	License type: open-access

Affiliation: King's College London

Abstract
Highlights/Summary
Full-Text
Similar Papers

Abstract

Listen

BackgroundA fundamental assumption of all widely-used multiple sequence alignment techniques is that the left- and right-most positions of the input sequences are relevant to the alignment. However, the position where a sequence starts or ends can be totally arbitrary due to a number of reasons: arbitrariness in the linearisation (sequencing) of a circular molecular structure; or inconsistencies introduced into sequence databases due to different linearisation standards. These scenarios are relevant, for instance, in the process of multiple sequence alignment of mitochondrial DNA, viroid, viral or other genomes, which have a circular molecular structure. A solution for these inconsistencies would be to identify a suitable rotation (cyclic shift) for each sequence; these refined sequences may in turn lead to improved multiple sequence alignments using the preferred multiple sequence alignment program.ResultsWe present MARS, a new heuristic method for improving Multiple circular sequence Alignment using Refined Sequences. MARS was implemented in the C++ programming language as a program to compute the rotations (cyclic shifts) required to best align a set of input sequences. Experimental results, using real and synthetic data, show that MARS improves the alignments, with respect to standard genetic measures and the inferred maximum-likelihood-based phylogenies, and outperforms state-of-the-art methods both in terms of accuracy and efficiency. Our results show, among others, that the average pairwise distance in the multiple sequence alignment of a dataset of widely-studied mitochondrial DNA sequences is reduced by around 5% when MARS is applied before a multiple sequence alignment is performed.ConclusionsAnalysing multiple sequences simultaneously is fundamental in biological research and multiple sequence alignment has been found to be a popular method for this task. Conventional alignment techniques cannot be used effectively when the position where sequences start is arbitrary. We present here a method, which can be used in conjunction with any multiple sequence alignment program, to address this problem effectively and efficiently.

Highlights

A fundamental assumption of all widely-used multiple sequence alignment techniques is that the leftand right-most positions of the input sequences are relevant to the alignment
We present here a method, which can be used in conjunction with any multiple sequence alignment program, to address this problem effectively and efficiently
A fundamental assumption of all widely-used multiple sequence alignment (MSA) techniques is that the left- and right-most positions of the input sequences are relevant to the alignment

Summary

Results

MARS was implemented in the C++ programming language as a program to compute the rotations (cyclic shifts) required to best align a set of input sequences. The following scores were computed for the rotated dataset in the respective order: 233, 244, and 269 These results show that when using two different MSA programs, MARS obtains a higher TCS than the unrotated dataset in both cases, outperforming BEAR and Cyclope, which do not always obtain a higher TCS compared to that of the unrotated dataset

Conclusions

Background

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

MARS: improving multiple circular sequence alignment using refined sequences

Abstract

Highlights

Summary

Published Version

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts
Xin Deng ... Jianlin Cheng
BMC Bioinformatics | VOL. 12
Xin Deng, et. al.Xin Deng ... Jianlin Cheng
01 Dec 2011
BMC Bioinformatics | VOL. 12

Integration of Alignment and Phylogeny in the Whole-Genome Era

-

18 Jun 2015
18 Jun 2015

A novel fast multiple nucleotide sequence alignment method based on FM-index
Huan Liu ... Yun Xu
Briefings in Bioinformatics | VOL. 23
Huan Liu, et. al.Huan Liu ... Yun Xu
10 Dec 2021
Briefings in Bioinformatics | VOL. 23

PROMALS3D: a tool for multiple protein sequence and structure alignments
Jimin Pei ... Bong-Hyun Kim
Nucleic Acids Research | VOL. 36
Jimin Pei, et. al.Jimin Pei ... Bong-Hyun Kim
20 Feb 2008
Nucleic Acids Research | VOL. 36

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

MARS: improving multiple circular sequence alignment using refined sequences

Abstract

Highlights

Summary

Published Version

Talk to us

Similar Papers

More From: BMC Genomics