Abstract

Sequence alignment is a significant facet in the bio-informatics research field for the molecular sequence analysis. Arrangement of two biological sequences by maximizing the similarities between the sequences by incorporating and adjusting gaps is Pairwise Sequence Alignment (PSA). Arrangement of multiple sequences is Multiple Sequence Alignment (MSA). Though Dynamic programming can produce optimal sequence alignment for PSA it suffers from a problem when multiple optimal paths are present and trace back is required. Back tracking becomes complex and it is also not suitable for MSA. So many meta-heuristic algorithms like Genetic Algorithm (GA) and Differential Evolutionary Algorithm (DE) are developed in the recent years to resolve the issue of optimization. Both GA and DE are used to produce optimal sequence alignment. But Compared to GA, DE is able to produce more optimal sequence alignment. To further enhance the performance of DE a new mutant is proposed by considering best, worst and a random candidate solution and applied on DE. It is named as New Differential Evolutionary Algorithm (NDE). By taking the test sequences from a bench mark data set “prefab4ref” tests are performed on GA, All DE mutants and NDE and it is observed that the proposed algorithm NDE outperformed all the other algorithms.

Highlights

  • Biological Informatics, in brief Bioinformatics is a combination of Biology, Computer Science and Information Technology

  • Alignment of two biological sequences is known as Pairwise Sequence Alignment (PSA) and alignment of three or more biological sequences is known as Multiple Sequence Alignment (MSA)

  • Scientists tried to use nature inspired optimization algorithms to solve the problem of sequence alignment

Read more

Summary

INTRODUCTION

Biological Informatics, in brief Bioinformatics is a combination of Biology, Computer Science and Information Technology. Another Scoring function called Column Score (CS) can be used, in which identical nucleotide bases are present in a single column a value of ‘1’ is assigned otherwise a value of ‘0’ is assigned For protein sequences another scoring function called Similarity Score (SS) can be used along with scoring functions like IS and CS. Pairwise Sequence Alignment by Differential Evolutionary Algorithm with New Mutation Strategy. Higgins developed a package called SAGA: sequence alignment by genetic algorithm [13] which is useful for finding globally optimal multiple alignments in reasonable time. Fitness Functions Many fitness functions or objective functions can be used to calculate the scores while performing sequence alignment They are Identity Score, Similarity Score and Column Score.

PAIRWISE SEQUENCE ALIGNMENT AND FITNESS FUNCTIONS
Alignment of Sequences
DE Mutant
Differential Evolutionary Algorithm
Mutation
RESULTS AND DISCUSSIONS
PSA Using Similarity Score
PSA Using Column Score
Ddiscussions
LIMITATIONS
CONCLUSIONS AND FUTURE ACCOMPLISHMENTS
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call