MAGUS: Multiple sequence Alignment using Graph clUStering.

Vladimir Smirnov,Tandy Warnow

doi:10.1093/bioinformatics/btaa992

Vladimir Smirnov, Tandy Warnow

Open Access

https://doi.org/10.1093/bioinformatics/btaa992

Copy DOI

Journal: Bioinformatics	Publication Date: Nov 30, 2020
Citations: 49	License type: CC BY-NC 4.0

Affiliation: University of Illinois Urbana-Champaign

Abstract

MotivationThe estimation of large multiple sequence alignments (MSAs) is a basic bioinformatics challenge. Divide-and-conquer is a useful approach that has been shown to improve the scalability and accuracy of MSA estimation in established methods such as SATé and PASTA. In these divide-and-conquer strategies, a sequence dataset is divided into disjoint subsets, alignments are computed on the subsets using base MSA methods (e.g. MAFFT), and then merged together into an alignment on the full dataset.ResultsWe present MAGUS, Multiple sequence Alignment using Graph clUStering, a new technique for computing large-scale alignments. MAGUS is similar to PASTA in that it uses nearly the same initial steps (starting tree, similar decomposition strategy, and MAFFT to compute subset alignments), but then merges the subset alignments using the Graph Clustering Merger, a new method for combining disjoint alignments that we present in this study. Our study, on a heterogeneous collection of biological and simulated datasets, shows that MAGUS produces improved accuracy and is faster than PASTA on large datasets, and matches it on smaller datasets.Availability and implementationMAGUS: https://github.com/vlasmirnov/MAGUSSupplementary information Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MAGUS: Multiple sequence Alignment using Graph clUStering.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Similar Papers

MARS: improving multiple circular sequence alignment using refined sequences
Lorraine A K Ayad ... Solon P Pissis
BMC Genomics | VOL. 18
Lorraine A K Ayad, et. al.Lorraine A K Ayad ... Solon P Pissis
14 Jan 2017
BMC Genomics | VOL. 18

Evolutionarily Conserved Allosteric Network in the Cys Loop Family of Ligand-gated Ion Channels Revealed by Statistical Covariance Analyses
Yonghui Chen ... Yongchang Chang
Journal of Biological Chemistry | VOL. 281
Yonghui Chen, et. al.Yonghui Chen ... Yongchang Chang
01 Jun 2006
Journal of Biological Chemistry | VOL. 281

Constructing genetic exchange communities among bacteria and archaea
Yingnan Cong
-
Yingnan CongYingnan Cong
21 Oct 2016
21 Oct 2016

Heuristic Methods for Finding Pathogenic Variants in Gene Coding Sequences
Monique Ohanian ... Diane Fatkin
Journal of the American Heart Association | VOL. 1
Monique Ohanian, et. al.Monique Ohanian ... Diane Fatkin
26 Sep 2012
Journal of the American Heart Association | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MAGUS: Multiple sequence Alignment using Graph clUStering.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics