A Greedy Clustering Algorithm for Multiple Sequence Alignment

Rabah Lebsir,Tahi Fariza,Abdesslem Layeb

doi:10.4018/ijcini.20211001.oa41

Abstract

This paper presents a strategy to tackle the Multiple Sequence Alignment (MSA) problem, which is one of the most important tasks in the biological sequence analysis. Its role is to align the sequences in their entirety to derive relationships and common characteristics between a set of protein or nucleotide sequences. The MSA problem was proved to be an NP-Hard problem. The proposed strategy incorporates a new idea based on the well-known divide and conquer paradigm. This paper presents a novel method of clustering sequences as a preliminary step to improve the final alignment; this decomposition can be used as an optimization procedure with any MSA aligner to explore promising alignments of the search space. In their solution, authors proposed to align the clusters in a parallel and distributed way in order to benefit from parallel architectures. The strategy was tested using classical benchmarks like BAliBASE, Sabre, Prefab4 and Oxm, and the experimental results show that it gives good results by comparing to the other aligners.

Highlights

The multiple sequence alignment (MSA) consists to align more than two biological sequences like DNA or protein to bring out similar or homologous regions
This paper presents a novel method of clustering sequences as a preliminary step to improve the final alignment; this decomposition can be used as an optimization procedure with any MSA aligner to explore promising alignments of the search space
In this paper, a new strategy to tackle the MSA problem is developed based on the divide and conquer approach

Summary

Introduction

The multiple sequence alignment (MSA) consists to align more than two biological sequences like DNA or protein to bring out similar or homologous regions. MSA plays an important task in Bioinformatics and it is widely used like in protein analysis, identification of functional sites in genomic sequences, structural prediction, etc. Finding an optimal MSA has been demonstrated NP-hard (Wang & Jiang, 1994). MSA is an optimization problem, which exhibits a high time and space complexity. To solve this problem, several methods were proposed. They can be categorized into three classes (Notredame, 2002): exact methods, progressive methods and iterative methods

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Greedy Clustering Algorithm for Multiple Sequence Alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Cognitive Informatics and Natural Intelligence

Lead the way for us

Journal: International Journal of Cognitive Informatics and Natural Intelligence	Publication Date: May 28, 2021
License type: CC BY 3.0

Similar Papers

A Greedy Clustering Algorithm for Multiple Sequence Alignment
-
International Journal of Cognitive Informatics and Natural Intelligence | VOL. 15
--
01 Oct 2021
International Journal of Cognitive Informatics and Natural Intelligence | VOL. 15

Protein multiple sequence alignment by hybrid bio-inspired algorithms.
Vincenzo Cutello ... Giuseppe Nicosia
Nucleic acids research | VOL. 39
Vincenzo Cutello, et. al.Vincenzo Cutello ... Giuseppe Nicosia
10 Nov 2010
Nucleic acids research | VOL. 39

A Quantum Evolutionary Algorithm for Effective Multiple Sequence Alignment
Souham Meshoul ... Mohamed Batouche
-
Souham Meshoul, et. al.Souham Meshoul ... Mohamed Batouche
01 Jan 2004
01 Jan 2004

Adaptation of the method of musical composition for solving the multiple sequence alignment problem
Roman Anselmo Mora-Gutiérrez ... Antonin Ponsich
Computing | VOL. 97
Roman Anselmo Mora-Gutiérrez, et. al.Roman Anselmo Mora-Gutiérrez ... Antonin Ponsich
13 Dec 2014
Computing | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Greedy Clustering Algorithm for Multiple Sequence Alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Cognitive Informatics and Natural Intelligence