Binding Site Graphs: A New Graph Theoretical Framework for Prediction of Transcription Factor Binding Sites

Timothy E Reddy,Charles Delisi,Boris E Shakhnovich

doi:10.1371/journal.pcbi.0030090

Timothy E Reddy, Charles Delisi + Show 1 more

Open Access

https://doi.org/10.1371/journal.pcbi.0030090

Copy DOI

Journal: PLoS Computational Biology	Publication Date: May 1, 2007
Citations: 65	License type: CC BY 4.0

Affiliation: Boston University, Harvard University

Abstract

Computational prediction of nucleotide binding specificity for transcription factors remains a fundamental and largely unsolved problem. Determination of binding positions is a prerequisite for research in gene regulation, a major mechanism controlling phenotypic diversity. Furthermore, an accurate determination of binding specificities from high-throughput data sources is necessary to realize the full potential of systems biology. Unfortunately, recently performed independent evaluation showed that more than half the predictions from most widely used algorithms are false. We introduce a graph-theoretical framework to describe local sequence similarity as the pair-wise distances between nucleotides in promoter sequences, and hypothesize that densely connected subgraphs are indicative of transcription factor binding sites. Using a well-established sampling algorithm coupled with simple clustering and scoring schemes, we identify sets of closely related nucleotides and test those for known TF binding activity. Using an independent benchmark, we find our algorithm predicts yeast binding motifs considerably better than currently available techniques and without manual curation. Importantly, we reduce the number of false positive predictions in yeast to less than 30%. We also develop a framework to evaluate the statistical significance of our motif predictions. We show that our approach is robust to the choice of input promoters, and thus can be used in the context of predicting binding positions from noisy experimental data. We apply our method to identify binding sites using data from genome scale ChIP–chip experiments. Results from these experiments are publicly available at http://cagt10.bu.edu/BSG. The graphical framework developed here may be useful when combining predictions from numerous computational and experimental measures. Finally, we discuss how our algorithm can be used to improve the sensitivity of computational predictions of transcription factor binding specificities.

Highlights

Transcription factors (TFs) bind short stretches of DNA near the gene’s transcription start site
A historically difficult problem in computational biology is the identification of transcription factor binding sites (TFBS) in the promoters of co-regulated genes
We introduce a fundamentally new approach to the identification of TFBS

Summary

Introduction

Transcription factors (TFs) bind short stretches (usually 6– 18 bp) of DNA near the gene’s transcription start site. This event is thought to facilitate regulation of expression of the downstream gene through TF interaction with the RNA polymerase and other factors in the pre-initiation complex [1]. The main in vivo approaches to TF binding site determination are variants of ChIP–chip assays, and DNA footprinting The former, which is essentially a high-throughput version of the latter, can identify approximate location of binding, usually accurate enough to within the length of a promoter [5,6]. Most computational algorithms depend on experimental assays to identify sets of co-regulated genes and work by recognizing over-represented, short stretches of DNA

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Binding Site Graphs: A New Graph Theoretical Framework for Prediction of Transcription Factor Binding Sites

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS Computational Biology

Lead the way for us

Similar Papers

Transcription Factor Binding Affinities and DNA Shape Readout.
Max Schnepf ... Christophe Jung
iScience | VOL. 23
Max Schnepf, et. al.Max Schnepf ... Christophe Jung
15 Oct 2020
iScience | VOL. 23

Decision letter: Promoter sequence and architecture determine expression variability and confer robustness to genetic variants
George H Perry
-
George H PerryGeorge H Perry
07 Sep 2022
07 Sep 2022

Author response: Promoter sequence and architecture determine expression variability and confer robustness to genetic variants
Hjörleifur Einarsson ... Marco Salvatore
-
Hjörleifur Einarsson, et. al.Hjörleifur Einarsson ... Marco Salvatore
03 Nov 2022
03 Nov 2022

DeePromoter: Robust Promoter Predictor Using Deep Learning.
Mhaned Oubounyt ... Zakaria Louadi
Frontiers in Genetics | VOL. 10
Mhaned Oubounyt, et. al.Mhaned Oubounyt ... Zakaria Louadi
05 Apr 2019
Frontiers in Genetics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Binding Site Graphs: A New Graph Theoretical Framework for Prediction of Transcription Factor Binding Sites

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS Computational Biology