STREME: accurate and versatile sequence motif discovery.

Timothy L Bailey

doi:10.1093/bioinformatics/btab203

Abstract

Sequence motif discovery algorithms can identify novel sequence patterns that perform biological functions in DNA, RNA and protein sequences-for example, the binding site motifs of DNA- and RNA-binding proteins. The STREME algorithm presented here advances the state-of-the-art in ab initio motif discovery in terms of both accuracy and versatility. Using in vivo DNA (ChIP-seq) and RNA (CLIP-seq) data, and validating motifs with reference motifs derived from in vitro data, we show that STREME is more accurate, sensitive and thorough than several widely used algorithms (DREME, HOMER, MEME, Peak-motifs) and two other representative algorithms (ProSampler and Weeder). STREME's capabilities include the ability to find motifs in datasets with hundreds of thousands of sequences, to find both short and long motifs (from 3 to 30 positions), to perform differential motif discovery in pairs of sequence datasets, and to find motifs in sequences over virtually any alphabet (DNA, RNA, protein and user-defined alphabets). Unlike most motif discovery algorithms, STREME reports a useful estimate of the statistical significance of each motif it discovers. STREME is easy to use individually via its web server or via the command line, and is completely integrated with the widely used MEME Suite of sequence analysis tools. The name STREME stands for 'Simple, Thorough, Rapid, Enriched Motif Elicitation'. The STREME web server and source code are provided freely for non-commercial use at http://meme-suite.org. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

STREME: accurate and versatile sequence motif discovery.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Mar 24, 2021
Citations: 334

Similar Papers

A Sequential Monte Carlo Method for Motif Discovery
...
-
, et. al. ...
04 Sep 2006
04 Sep 2006

Guiding motif discovery by iterative pattern refinement
Zhiping Wang ... Sun Kim
-
Zhiping Wang, et. al.Zhiping Wang ... Sun Kim
14 Mar 2004
14 Mar 2004

Leveraging cross-link modification events in CLIP-seq for motif discovery.
Emad Bahrami-Samani ... Philip J Uren
Nucleic Acids Research | VOL. 43
Emad Bahrami-Samani, et. al.Emad Bahrami-Samani ... Philip J Uren
10 Dec 2014
Nucleic Acids Research | VOL. 43

A Sequential Monte Carlo Method for Motif Discovery
Kuo-Ching Liang ... Xiaodong Wang
IEEE Transactions on Signal Processing | VOL. 56
Kuo-Ching Liang, et. al.Kuo-Ching Liang ... Xiaodong Wang
01 Sep 2008
IEEE Transactions on Signal Processing | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

STREME: accurate and versatile sequence motif discovery.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics