Multiple sequence alignment with user-defined anchor points

Burkhard Morgenstern,Peter F Stadler,Dirk Pöhler,Sonja J Prohaska

doi:10.1186/1748-7188-1-6

Burkhard Morgenstern, Peter F Stadler + Show 2 more

Open Access

https://doi.org/10.1186/1748-7188-1-6

Copy DOI

Abstract

BackgroundAutomated software tools for multiple alignment often fail to produce biologically meaningful results. In such situations, expert knowledge can help to improve the quality of alignments.ResultsHerein, we describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. It is possible for the user to specify parts of the sequences that are assumed to be homologous and should therefore be aligned to each other. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a demonstration of how our method works, we apply our approach to genomic sequences around the Hox gene cluster and to a set of DNA-binding proteins. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system.

Highlights

Automated software tools for multiple alignment often fail to produce biologically meaningful results
Multiple sequence alignment is a crucial prerequisite for biological sequence data analysis, and a large number of multi-alignment programs have been developed during the last twenty years
Most methods use a welldefined objective function assigning numerical quality score to every possible output alignment of an input sequence set and try to find an optimal or near-optimal alignment according to this objective function

Summary

Results

We describe a semi-automatic version of the alignment program DIALIGN that can take pre-defined constraints into account. Our software program can use these sites as anchor points by creating a multiple alignment respecting these constraints. This way, our alignment method can produce alignments that are biologically more meaningful than alignments produced by fully automated procedures. As a by-product, we obtain insights about the performance of the greedy algorithm that our program uses for multiple alignment and about the underlying objective function. This information will be useful for the further development of DIALIGN. The described alignment approach has been integrated into the TRACKER software system

Background

Conclusion

Edgar R: MUSCLE

17. Morgenstern B

23. Heringa J

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms for Molecular Biology	Publication Date: Apr 19, 2006
Citations: 82	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Multiple sequence alignment with user-defined anchor points

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology

Lead the way for us

Similar Papers

MSACompro: protein multiple sequence alignment using predicted secondary structure, solvent accessibility, and residue-residue contacts
Xin Deng ... Jianlin Cheng
BMC Bioinformatics | VOL. 12
Xin Deng, et. al.Xin Deng ... Jianlin Cheng
01 Dec 2011
BMC Bioinformatics | VOL. 12

Integration of Alignment and Phylogeny in the Whole-Genome Era

-

18 Jun 2015
18 Jun 2015

Multiple sequence alignment methods
...
-
, et. al. ...
23 Apr 1998
23 Apr 1998

Genomic multiple sequence alignments: refinement using a genetic algorithm
Chunlin Wang ... Elliot J Lefkowitz
BMC Bioinformatics | VOL. 6
Chunlin Wang, et. al.Chunlin Wang ... Elliot J Lefkowitz
08 Aug 2005
BMC Bioinformatics | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multiple sequence alignment with user-defined anchor points

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology