A proximity-based graph clustering method for the identification and application of transcription factor clusters

Maxwell Spadafore,Alan P Boyle,Kayvan Najarian

doi:10.1186/s12859-017-1935-y

Maxwell Spadafore, Alan P Boyle + Show 1 more

Open Access

https://doi.org/10.1186/s12859-017-1935-y

Copy DOI

Journal: BMC Bioinformatics	Publication Date: Nov 29, 2017
Citations: 1	License type: open-access

Affiliation: University of Michigan–Ann Arbor

Abstract

BackgroundTranscription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to bind the genome in clusters, and current methods to identify these clusters are either limited in scope, unable to detect relationships beyond motif similarity, or not applied to TF-TF interactions.MethodsHere, we present a proximity-based graph clustering approach to identify TF clusters using either ChIP-seq or motif search data. We use TF co-occurrence to construct a filtered, normalized adjacency matrix and use the Markov Clustering Algorithm to partition the graph while maintaining TF-cluster and cluster-cluster interactions. We then apply our graph structure beyond clustering, using it to increase the accuracy of motif-based TFBS searching for an example TF.ResultsWe show that our method produces small, manageable clusters that encapsulate many known, experimentally validated transcription factor interactions and that our method is capable of capturing interactions that motif similarity methods might miss. Our graph structure is able to significantly increase the accuracy of motif TFBS searching, demonstrating that the TF-TF connections within the graph correlate with biological TF-TF interactions.ConclusionThe interactions identified by our method correspond to biological reality and allow for fast exploration of TF clustering and regulatory dynamics.

Highlights

Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health
Because our method finds TF-TF interactions based on genomic colocation and is entirely focused on transcription factors, while STRING is focused on all protein-protein interactions and derives its interactions from very diverse data sources, it is expected that our method would produce many novel predictions when compared to STRING
For the ChIP-seq and Encyclopedia of DNA Elements (ENCODE)-motif datasets, we found that our method identified TF-TF interactions which were significantly (p < 0.05 and p < 0.001, respectively) more enriched in the Co-expression evidence category when compared to STRING interactions which were not predicted by our method

Summary

Introduction

Transcription factors (TFs) form a complex regulatory network within the cell that is crucial to cell functioning and human health. While methods to establish where a TF binds to DNA are well established, these methods provide no information describing how TFs interact with one another when they do bind. TFs tend to cooperatively bind the genome as large complexes, or clusters, binding to the DNA, one another, or both [10, 11]. In these situations, one or more “anchor” TFs bind the DNA directly, and other TFs bind the anchors rather than the DNA. This creates a combinatorial problem, wherein a given anchor TF may be bound by several different other TFs depending on time, cellular conditions, etc., and a given association (non-anchor) TF may bind

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A proximity-based graph clustering method for the identification and application of transcription factor clusters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

CMGRN: a web server for constructing multilevel gene regulatory networks using ChIP-seq and gene expression data
Daogang Guan ... Yan Liang
Bioinformatics | VOL. 30
Daogang Guan, et. al.Daogang Guan ... Yan Liang
02 Jan 2014
Bioinformatics | VOL. 30

Dynamic Regulation of Schwann Cell Enhancers after Peripheral Nerve Injury
Holly A Hung ... John Svaren
Journal of Biological Chemistry | VOL. 290
Holly A Hung, et. al.Holly A Hung ... John Svaren
01 Mar 2015
Journal of Biological Chemistry | VOL. 290

Decision letter: The single-cell chromatin accessibility landscape in mouse perinatal testis development
Deborah Bourc'his ... Marianne E Bronner
-
Deborah Bourc'his, et. al.Deborah Bourc'his ... Marianne E Bronner
31 Jan 2022
31 Jan 2022

Cultures of the Central Highlands, New Guinea
K E Read
Southwestern Journal of Anthropology | VOL. 10
K E ReadK E Read
01 Apr 1954
Southwestern Journal of Anthropology | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A proximity-based graph clustering method for the identification and application of transcription factor clusters

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics