A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

Minli Xu,Zhengchang Su,Vladimir Brusic

doi:10.1371/journal.pone.0008797

Minli Xu, Zhengchang Su + Show 1 more

Open Access

https://doi.org/10.1371/journal.pone.0008797

Copy DOI

Journal: PLoS ONE	Publication Date: Jan 20, 2010
Citations: 50	License type: CC BY 4.0

Affiliation: University of North Carolina at Charlotte

Abstract

BackgroundTranscription factor binding site (TFBS) motifs can be accurately represented by position frequency matrices (PFM) or other equivalent forms. We often need to compare TFBS motifs using their PFMs in order to search for similar motifs in a motif database, or cluster motifs according to their binding preference. The majority of current methods for motif comparison involve a similarity metric for column-to-column comparison and a method to find the optimal position alignment between the two compared motifs. In some applications, alignment-free methods might be preferred; however, few such methods with high accuracy have been described.Methodology/Principal FindingsHere we describe a novel alignment-free method for quantifying the similarity of motifs using their PFMs by converting PFMs into k-mer vectors. The motifs could then be compared by measuring the similarity among their corresponding k-mer vectors.Conclusions/SignificanceWe demonstrate that our method in general achieves similar performance or outperforms the existing methods for clustering motifs according to their binding preference and identifying similar motifs of transcription factors of the same family.

Highlights

Transcription factors (TFs) play important roles in the regulation of gene transcription through binding to specific DNA sequences called TF binding sites (TFBSs), which are usually 5–25 bp in length [1,2]
A TFBS motif is often represented by a position frequency matrix (PFM), which consists of nucleotide frequencies at each position of the motif [3]
We evaluated our algorithm for identifying the TFBS motifs of structural and/or evolutionarily related TFs using all three datasets by the ‘‘best-hit’’ approach used in Mahony et al [8]

Summary

Introduction

Transcription factors (TFs) play important roles in the regulation of gene transcription through binding to specific DNA sequences called TF binding sites (TFBSs), which are usually 5–25 bp in length [1,2]. A PFM is derived from the alignment of known TFBSs of the TF, and it largely reflects the TF’s DNA binding preference at each position. In genome-scale TFBS prediction applications, redundant and sub motifs of the same TFs are often returned by motif finders, and they need to be clustered to form unique motifs [9,10]. In all these applications, the similarity between two motifs needs to be accurately calculated for the desired purposes. Transcription factor binding site (TFBS) motifs can be accurately represented by position frequency matrices (PFM) or other equivalent forms. Alignment-free methods might be preferred; few such methods with high accuracy have been described

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

MCOIN: a novel heuristic for determining TFBS motif width
...
-
, et. al. ...
18 Jun 2013
18 Jun 2013

Meta-analysis discovery of tissue-specific DNA sequence motifs from mammalian gene expression data
Bertrand R Huber ... Martha L Bulyk
BMC Bioinformatics | VOL. 7
Bertrand R Huber, et. al.Bertrand R Huber ... Martha L Bulyk
27 Apr 2006
BMC Bioinformatics | VOL. 7

SPIC: A novel similarity metric for comparing transcription factor binding site motifs based on information contents
Shaoqiang Zhang ... Chuanbin Du
BMC Systems Biology | VOL. 7
Shaoqiang Zhang, et. al.Shaoqiang Zhang ... Chuanbin Du
01 Jan 2013
BMC Systems Biology | VOL. 7

Stochastic EM-based TFBS motif discovery with MITSU
Alastair M Kilpatrick ... Bruce Ward
Bioinformatics | VOL. 30
Alastair M Kilpatrick, et. al.Alastair M Kilpatrick ... Bruce Ward
11 Jun 2014
Bioinformatics | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE