IModulonMiner and PyModulon: Software for unsupervised mining of gene expression compendia.

Anand V Sastry,Yuan Yuan,Saugat Poudel,Kevin Rychel,Reo Yoo,Cameron R Lamoureux,Gaoyuan Li,Joshua T Burrows,Siddharth Chauhan,Zachary B Haiman,Tahani Al Bulushi,Yara Seif,Bernhard O Palsson,Daniel C Zielinski

doi:10.1371/journal.pcbi.1012546

Abstract

Public gene expression databases are a rapidly expanding resource of organism responses to diverse perturbations, presenting both an opportunity and a challenge for bioinformatics workflows to extract actionable knowledge of transcription regulatory network function. Here, we introduce a five-step computational pipeline, called iModulonMiner, to compile, process, curate, analyze, and characterize the totality of RNA-seq data for a given organism or cell type. This workflow is centered around the data-driven computation of co-regulated gene sets using Independent Component Analysis, called iModulons, which have been shown to have broad applications. As a demonstration, we applied this workflow to generate the iModulon structure of Bacillus subtilis using all high-quality, publicly-available RNA-seq data. Using this structure, we predicted regulatory interactions for multiple transcription factors, identified groups of co-expressed genes that are putatively regulated by undiscovered transcription factors, and predicted properties of a recently discovered single-subunit phage RNA polymerase. We also present a Python package, PyModulon, with functions to characterize, visualize, and explore computed iModulons. The pipeline, available at https://github.com/SBRG/iModulonMiner, can be readily applied to diverse organisms to gain a rapid understanding of their transcriptional regulatory network structure and condition-specific activity.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

IModulonMiner and PyModulon: Software for unsupervised mining of gene expression compendia.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology

Lead the way for us

Journal: PLoS computational biology	Publication Date: Oct 23, 2024
License type: CC BY 4.0

Similar Papers

Elucidation of independently modulated genes in Streptococcus pyogenes reveals carbon sources that control its expression of hemolytic toxins.
Yujiro Hirose ... Richard Szubin
mSystems | VOL. 8
Yujiro Hirose, et. al.Yujiro Hirose ... Richard Szubin
29 Jun 2023
mSystems | VOL. 8

Global and Threshold-Free Transcriptional Regulatory Networks Reconstruction Through Integrating ChIP-Chip and Expression Data
Qi Liu ... Zili Zhang
Current Protein & Peptide Science | VOL. 12
Qi Liu, et. al.Qi Liu ... Zili Zhang
01 Nov 2011
Current Protein & Peptide Science | VOL. 12

Revisiting regulatory coherence: accounting for temporal bias in plant gene co-expression analyses.
Haoran Cai ... David L Des Marais
New Phytologist | VOL. 238
Haoran Cai, et. al.Haoran Cai ... David L Des Marais
27 Jan 2023
New Phytologist | VOL. 238

Transcriptional regulation shapes the organization of genes on bacterial chromosomes
Sarath Chandra Janga ... Agustino Martínez-Antonio
Nucleic Acids Research | VOL. 37
Sarath Chandra Janga, et. al.Sarath Chandra Janga ... Agustino Martínez-Antonio
16 Apr 2009
Nucleic Acids Research | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

IModulonMiner and PyModulon: Software for unsupervised mining of gene expression compendia.

Abstract

Talk to us

Similar Papers

More From: PLoS computational biology