A hybrid algorithm for identifying partially conserved regions in multiple sequence alignment

Gamage Kokila Kasuni Perera,Champi Thusangi Wannige

doi:10.1080/1206212x.2019.1628468

Abstract

Multiple sequence alignment (MSA) algorithms are used to infer homologous regions in DNA and protein sequences which provide the basis for many microbiological studies. Center star method is an MSA algorithm with the ability to address a large-scale dataset, but it tends to produce poor results in the presence of multiple centers in the set of sequences. In such cases, partially conserved regions are often hidden in the alignment. We introduce an algorithm to address this problem based on Center star and progressive methods for MSA. In this algorithm, we first identify the subsets of sequences within the sequences by applying the Bisecting – kmeans algorithm using K-mers as the attributes for clustering. The center star method is performed separately on each subset of sequences. Finally, we merge these alignments by following a progressive alignment approach. An evaluation is carried out by using a set of DNA sequences from some HIV-1 infected patients with a known transmission chain. According to its results, the new algorithm produces output with better sum of pairs scores compared to center star methods and more accurate phylogeny could be generated using the resulting final alignment compared to the center star and progressive methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A hybrid algorithm for identifying partially conserved regions in multiple sequence alignment

Abstract

Talk to us

Similar Papers

More From: International Journal of Computers and Applications

Lead the way for us

Similar Papers

A hybrid algorithm for multiple DNA sequence alignment
Kokila K Perera ... C Thusangi Wannige
-
Kokila K Perera, et. al.Kokila K Perera ... C Thusangi Wannige
01 Sep 2016
01 Sep 2016

CSA: An efficient algorithm to improve circular DNA multiple alignment
Francisco Fernandes ... Luísa Pereira
BMC Bioinformatics | VOL. 10
Francisco Fernandes, et. al.Francisco Fernandes ... Luísa Pereira
23 Jul 2009
BMC Bioinformatics | VOL. 10

AlineaGA: A Genetic Algorithm for Multiple Sequence Alignment
Fernando José Mateus Da Silva ... Juan Antonio Gómez Pulido
-
Fernando José Mateus Da Silva, et. al.Fernando José Mateus Da Silva ... Juan Antonio Gómez Pulido
01 Jan 2008
01 Jan 2008

Benchmark of algorithms for multiple DNA sequence alignment across livestock species
Artur Bąk ... Chandra Shekhar Pareek
Translational Research in Veterinary Science | VOL. 3
Artur Bąk, et. al.Artur Bąk ... Chandra Shekhar Pareek
24 Jan 2021
Translational Research in Veterinary Science | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hybrid algorithm for identifying partially conserved regions in multiple sequence alignment

Abstract

Talk to us

Similar Papers

More From: International Journal of Computers and Applications