Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

Jeremiah J Faith,Guillaume Cottarel,Simon Kasif,Jamey Wierzbowski,Joshua T Thaden,Timothy S Gardner,Boris Hayete,James J Collins,Ilaria Mogno

doi:10.1371/journal.pbio.0050008

Abstract

Machine learning approaches offer the potential to systematically identify transcriptional regulatory interactions from a compendium of microarray expression profiles. However, experimental validation of the performance of these methods at the genome scale has remained elusive. Here we assess the global performance of four existing classes of inference algorithms using 445 Escherichia coli Affymetrix arrays and 3,216 known E. coli regulatory interactions from RegulonDB. We also developed and applied the context likelihood of relatedness (CLR) algorithm, a novel extension of the relevance networks class of algorithms. CLR demonstrates an average precision gain of 36% relative to the next-best performing algorithm. At a 60% true positive rate, CLR identifies 1,079 regulatory interactions, of which 338 were in the previously known network and 741 were novel predictions. We tested the predicted interactions for three transcription factors with chromatin immunoprecipitation, confirming 21 novel interactions and verifying our RegulonDB-based performance estimates. CLR also identified a regulatory link providing central metabolic control of iron transport, which we confirmed with real-time quantitative PCR. The compendium of expression data compiled in this study, coupled with RegulonDB, provides a valuable model system for further improvement of network inference algorithms using experimental data.

Highlights

High-throughput genome sequencing and bioinformatics technologies have dramatically eased the task of genomic annotation, producing parts lists of living organisms as simple as Mycoplasmas and as complex as mammals
The total space of possible transcriptional regulatory interactions for an organism is the number of transcription factors multiplied by the number of genes multiplied by the number of environmental contexts in which the cell might find itself
Organisms can adapt to changing environments—becoming more virulent, for example, or activating stress responses—thanks to a flexible gene expression program controlled by the dynamic interactions of hundreds of transcriptional regulators

Summary

Introduction

High-throughput genome sequencing and bioinformatics technologies have dramatically eased the task of genomic annotation, producing parts lists of living organisms as simple as Mycoplasmas and as complex as mammals. Pioneering efforts to identify regulatory interactions on a genome scale have used machine-learning algorithms to identify cis-regulatory motifs or transcription factor target genes using a large set of expression arrays [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18], genomewide location analysis chromatin immunoprecipitation (ChIP-Chip) [19,20], or a combination of these and other high-throughput methods [21,22,23,24,25,26]. We demonstrate an unsupervised network inference method, context likelihood of relatedness (CLR), which uses transcriptional profiles of an organism across a diverse set of conditions to systematically determine transcriptional regulatory interactions.

Author Summary

Findings

Materials and Methods

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS Biology	Publication Date: Jan 1, 2007
Citations: 1525	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLoS Biology

Lead the way for us

Similar Papers

Reconstruction of dynamic regulatory networks reveals signaling-induced topology changes associated with germ layer specification.
Emily Y Su ... Patrick Cahan
Stem Cell Reports | VOL. 17
Emily Y Su, et. al.Emily Y Su ... Patrick Cahan
27 Jan 2022
Stem Cell Reports | VOL. 17

Transcriptome Analysis Identifies Regulators of Hematopoietic Stem and Progenitor Cells
Roi Gazit ... Amy J Wagers
Stem Cell Reports | VOL. 1
Roi Gazit, et. al.Roi Gazit ... Amy J Wagers
15 Aug 2013
Stem Cell Reports | VOL. 1

Gene regulation network inference using k-nearest neighbor-based mutual information estimation: revisiting an old DREAM
Lior I Shachaf ... Patrick Cahan
BMC Bioinformatics | VOL. 24
Lior I Shachaf, et. al.Lior I Shachaf ... Patrick Cahan
06 Mar 2023
BMC Bioinformatics | VOL. 24

Comparative analysis of module-based versus direct methods for reverse-engineering transcriptional regulatory networks
Tom Michoel ... Anagha Joshi
BMC Systems Biology | VOL. 3
Tom Michoel, et. al.Tom Michoel ... Anagha Joshi
07 May 2009
BMC Systems Biology | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: PLoS Biology