Abstract

Multiple sequence alignment (MSA) represents a basic task for many bioinformatics applications. MSA allows finding common conserved regions among various sequences of proteins or DNA. However, to find the optimal multiple sequence alignment, it is necessary to design an efficient exploration approach that could explore a huge number of possible multiple sequence alignments. As well as, it is required to use a powerful evaluation method to assess the biological relevance of these multiple sequence alignment. To address these main problems, this article presents a multiobjective artificial fish swarm algorithm (MOAFS) to solve multiple sequence alignment. MOAFS uses the behaviors of artificial fish swarm algorithm such as the cooperation, decentralization and parallelism to ensure a good trade-off between the exploration and the exploitation of the search space of MSA problem. To preserve the quality and consistency of alignment, two fitness functions have been simultaneously used by the MOAFS algorithm: (i) Weighted Sum of Pairs to determine similar regions horizontally and (ii) Similarity function to determine vertically similar regions between the sequences of an alignment. Following the exploration of space search, the Pareto-optimal set is obtained by MOAFS which performs the optimal multiple sequence alignments for both fitness functions. The performance of MOAFS algorithm has been proved by comparing our algorithm with different progressive alignment methods, and other alignment methods based on evolutionary algorithms with single-objective and many-objective. The experiment results conducted on BAliBASE 2.0 and BAliBASE 3.0 benchmark confirm that the MOAFS algorithm provides a greater accuracy statistical significance in terms of SP or CS scores.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call