Comparative analysis of the quality of a global algorithm and a local algorithm for alignment of two sequences.

Valery O Polyanovsky,Vladimir G Tumanyan,Mikhail A Roytberg

doi:10.1186/1748-7188-6-25

Valery O Polyanovsky, Vladimir G Tumanyan + Show 1 more

Open Access

https://doi.org/10.1186/1748-7188-6-25

Copy DOI

Abstract

BackgroundAlgorithms of sequence alignment are the key instruments for computer-assisted studies of biopolymers. Obviously, it is important to take into account the "quality" of the obtained alignments, i.e. how closely the algorithms manage to restore the "gold standard" alignment (GS-alignment), which superimposes positions originating from the same position in the common ancestor of the compared sequences. As an approximation of the GS-alignment, a 3D-alignment is commonly used not quite reasonably. Among the currently used algorithms of a pair-wise alignment, the best quality is achieved by using the algorithm of optimal alignment based on affine penalties for deletions (the Smith-Waterman algorithm). Nevertheless, the expedience of using local or global versions of the algorithm has not been studied.ResultsUsing model series of amino acid sequence pairs, we studied the relative "quality" of results produced by local and global alignments versus (1) the relative length of similar parts of the sequences (their "cores") and their nonhomologous parts, and (2) relative positions of the core regions in the compared sequences. We obtained numerical values of the average quality (measured as accuracy and confidence) of the global alignment method and the local alignment method for evolutionary distances between homologous sequence parts from 30 to 240 PAM and for the core length making from 10% to 70% of the total length of the sequences for all possible positions of homologous sequence parts relative to the centers of the sequences.ConclusionWe revealed criteria allowing to specify conditions of preferred applicability for the local and the global alignment algorithms depending on positions and relative lengths of the cores and nonhomologous parts of the sequences to be aligned. It was demonstrated that when the core part of one sequence was positioned above the core of the other sequence, the global algorithm was more stable at longer evolutionary distances and larger nonhomologous parts than the local algorithm. On the contrary, when the cores were positioned asymmetrically, the local algorithm was more stable at longer evolutionary distances and larger nonhomologous parts than the global algorithm. This opens a possibility for creation of a combined method allowing generation of more accurate alignments.

Highlights

Algorithms of sequence alignment are the key instruments for computer-assisted studies of biopolymers
They showed that regions of optimal alignment, recurring most frequently in suboptimal alignments, were very similar to alignments produced by the structural alignment methods
We suggest using a comparison of artificially generated sequences to evaluate the quality of alignment algorithms, because the GS alignment for such sequences is known from the very beginning

Summary

Introduction

Algorithms of sequence alignment are the key instruments for computer-assisted studies of biopolymers. Among the currently used algorithms of a pair-wise alignment, the best quality is achieved by using the algorithm of optimal alignment based on affine penalties for deletions (the Smith-Waterman algorithm). Pair-wise alignment of amino acid sequences is the main method of comparative protein analysis. Vingron and Argos [11] demonstrated that there was a relationship between conservatism of the optimal global alignment region in a set of suboptimal alignments and its similarity with the structural alignment results. They showed that regions of optimal alignment, recurring most frequently in suboptimal alignments, were very similar to alignments produced by the structural alignment methods

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Algorithms for Molecular Biology	Publication Date: Oct 27, 2011
Citations: 60	License type: cc-by

R Discovery Prime

R Discovery Prime

Comparative analysis of the quality of a global algorithm and a local algorithm for alignment of two sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology

Lead the way for us

Similar Papers

Analyzing the Interaction of RseA and RseB, the Two Negative Regulators of the σE Envelope Stress Response, Using a Combined Bioinformatic and Experimental Strategy
Nidhi Ahuja ... Carol A Gross
The Journal of biological chemistry | VOL. 284
Nidhi Ahuja, et. al.Nidhi Ahuja ... Carol A Gross
01 Feb 2009
The Journal of biological chemistry | VOL. 284

Glocal alignment: finding rearrangements during alignment.
Michael Brudno ... Sanket Malde
Computer applications in the biosciences : CABIOS | VOL. Suppl 19 1
Michael Brudno, et. al.Michael Brudno ... Sanket Malde
03 Jul 2003
Computer applications in the biosciences : CABIOS | VOL. Suppl 19 1

Accurate global and local 3D alignment of cryo-EM density maps using local spatial structural features.
Bintao He ... Renmin Han
Nature Communications | VOL. 15
Bintao He, et. al.Bintao He ... Renmin Han
21 Feb 2024
Nature Communications | VOL. 15

GLAlign: A Novel Algorithm for Local Network Alignment.
Marianna Milano ... Mario Cannataro
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 16
Marianna Milano, et. al.Marianna Milano ... Mario Cannataro
26 Apr 2018
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparative analysis of the quality of a global algorithm and a local algorithm for alignment of two sequences.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Algorithms for Molecular Biology