Graph search and variable neighborhood search for finding constrained longest common subsequences in artificial and real gene sequences

Marko Djukanović,Aleksandar Kartelj,Dragan Matić,Milana Grbić,Christian Blum,Günther R Raidl

doi:10.1016/j.asoc.2022.108844

Abstract

We consider the constrained longest common subsequence problem with an arbitrary set of input strings as well as an arbitrary set of pattern strings. This problem has applications, for example, in computational biology where it serves as a measure of similarity for sets of molecules with putative structures in common. We contribute in several ways. First, it is formally proven that finding a feasible solution of arbitrary length is, in general, NP-complete. Second, we propose several heuristic approaches: a greedy algorithm, a beam search aiming for feasibility, a variable neighborhood search, and a hybrid of the latter two approaches. An exhaustive experimental study shows the effectivity and differences of the proposed approaches in respect to finding a feasible solution, finding high-quality solutions, and runtime for both, artificial and real-world instance sets. The latter ones are generated from a set of 12681 bacteria 16S rRNA gene sequences and consider 15 primer contigs as pattern strings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Soft Computing	Publication Date: Apr 18, 2022
Citations: 4	License type: other-oa

R Discovery Prime

R Discovery Prime

Graph search and variable neighborhood search for finding constrained longest common subsequences in artificial and real gene sequences

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing

Lead the way for us

Similar Papers

Nonlinear Response Potential of Mainshock-Aftershock Sequences from Japanese Earthquakes
K Goda
Bulletin of the Seismological Society of America | VOL. 102
K GodaK Goda
01 Oct 2012
Bulletin of the Seismological Society of America | VOL. 102

On Solving a Generalized Constrained Longest Common Subsequence Problem
Marko Djukanovic ... Christian Blum
-
Marko Djukanovic, et. al.Marko Djukanovic ... Christian Blum
01 Jan 2020
01 Jan 2020

Longest common substring in Longest Common Subsequence’s solution service: A novel hyper-heuristic
Alireza Abdi ... Mohsen Hooshmand
Computational Biology and Chemistry | VOL. 105
Alireza Abdi, et. al.Alireza Abdi ... Mohsen Hooshmand
19 May 2023
Computational Biology and Chemistry | VOL. 105

Full-Text Indexes in External Memory
Juha Kärkkäinen ... S Srinivasa Rao
-
Juha Kärkkäinen, et. al.Juha Kärkkäinen ... S Srinivasa Rao
01 Jan 2003
01 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Graph search and variable neighborhood search for finding constrained longest common subsequences in artificial and real gene sequences

Abstract

Talk to us

Similar Papers

More From: Applied Soft Computing