Greedy mixture learning for multiple motif discovery in biological sequences

Konstantinos Blekas,Dimitrios I Fotiadis,Aristidis Likas

doi:10.1093/bioinformatics/btg037

Abstract

This paper studies the problem of discovering subsequences, known as motifs, that are common to a given collection of related biosequences, by proposing a greedy algorithm for learning a mixture of motifs model through likelihood maximization. The approach adds sequentially a new motif to a mixture model by performing a combined scheme of global and local search for appropriately initializing its parameters. In addition, a hierarchical partitioning scheme based on kd-trees is presented for partitioning the input dataset in order to speed-up the global searching procedure. The proposed method compares favorably over the well-known MEME approach and treats successfully several drawbacks of MEME. Experimental results indicate that the algorithm is advantageous in identifying larger groups of motifs characteristic of biological families with significant conservation. In addition, it offers better diagnostic capabilities by building more powerful statistical motif-models with improved classification accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Greedy mixture learning for multiple motif discovery in biological sequences

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Mar 22, 2003
Citations: 64

Similar Papers

A Sequential Method for Discovering Probabilistic Motifs in Proteins
A Likas ... K Blekas
Methods of Information in Medicine | VOL. 43
A Likas, et. al.A Likas ... K Blekas
01 Jan 2004
Methods of Information in Medicine | VOL. 43

Global2Local: Efficient Structure Search for Video Action Segmentation
Shang-Hua Gao ... Liang Wang
-
Shang-Hua Gao, et. al.Shang-Hua Gao ... Liang Wang
01 Jun 2021
01 Jun 2021

RF-Next: Efficient Receptive Field Search for Convolutional Neural Networks.
Shanghua Gao ... Qi Han
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45
Shanghua Gao, et. al.Shanghua Gao ... Qi Han
01 Jan 2021
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. 45

Incremental Mixture Learning for Clustering Discrete Data
Konstantinos Blekas ... Aristidis Likas
-
Konstantinos Blekas, et. al.Konstantinos Blekas ... Aristidis Likas
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Greedy mixture learning for multiple motif discovery in biological sequences

Abstract

Talk to us

Similar Papers

More From: Bioinformatics