Theoretical and empirical quality assessment of transcription factor-binding motifs

Alejandra Medina-Rivera,Heladia Salgado,Jacques Van Helden,Morgane Thomas-Chollier,Cei Abreu-Goodger,Julio Collado-Vides

doi:10.1093/nar/gkq710

Abstract

Position-specific scoring matrices (PSSMs) are routinely used to predict transcription factor (TF)-binding sites in genome sequences. However, their reliability to predict novel binding sites can be far from optimum, due to the use of a small number of training sites or the inappropriate choice of parameters when building the matrix or when scanning sequences with it. Measures of matrix quality such as E-value and information content rely on theoretical models, and may fail in the context of full genome sequences. We propose a method, implemented in the program ‘matrix-quality’, that combines theoretical and empirical score distributions to assess reliability of PSSMs for predicting TF-binding sites. We applied ‘matrix-quality’ to estimate the predictive capacity of matrices for bacterial, yeast and mouse TFs. The evaluation of matrices from RegulonDB revealed some poorly predictive motifs, and allowed us to quantify the improvements obtained by applying multi-genome motif discovery. Interestingly, the method reveals differences between global and specific regulators. It also highlights the enrichment of binding sites in sequence sets obtained from high-throughput ChIP-chip (bacterial and yeast TFs), and ChIP–seq and experiments (mouse TFs). The method presented here has many applications, including: selecting reliable motifs before scanning sequences; improving motif collections in TFs databases; evaluating motifs discovered using high-throughput data sets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Oct 4, 2010
Citations: 63	License type: CC BY-NC 2.5

R Discovery Prime

R Discovery Prime

Theoretical and empirical quality assessment of transcription factor-binding motifs

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

High resolution models of transcription factor-DNA affinities improve in vitro and in vivo binding predictions.
Phaedra Agius ... Aaron Arvey
PLoS computational biology | VOL. 6
Phaedra Agius, et. al.Phaedra Agius ... Aaron Arvey
09 Sep 2010
PLoS computational biology | VOL. 6

Guided deletion and mutagenesis analysis identified a tMEK2-responsive region in tomato lepr1b1 promoter
T Xing ... B.L Miki
Canadian Journal of Plant Pathology | VOL. 25
T Xing, et. al.T Xing ... B.L Miki
01 Jan 2003
Canadian Journal of Plant Pathology | VOL. 25

Integrating genomic data to predict transcription factor binding.
Dustin T Holloway ... Mark Kon
Genome Informatics | VOL. 16
Dustin T Holloway, et. al.Dustin T Holloway ... Mark Kon
01 Jan 2004
Genome Informatics | VOL. 16

Core transcriptional regulatory circuitry in human hepatocytes
Duncan T Odom ... Graeme I Bell
Molecular Systems Biology | VOL. 2
Duncan T Odom, et. al.Duncan T Odom ... Graeme I Bell
01 Jan 2006
Molecular Systems Biology | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Theoretical and empirical quality assessment of transcription factor-binding motifs

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research