CRMnet: A deep learning model for predicting gene expression from large regulatory sequence datasets.

Ke Ding,Jiayu Wen,Brian J. Parker,Gunjan Dixit

doi:10.3389/fdata.2023.1113402

Ke Ding, Jiayu Wen + Show 2 more

Open Access

https://doi.org/10.3389/fdata.2023.1113402

Copy DOI

Journal: Frontiers in big data	Publication Date: Mar 14, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Australian National University

Abstract

Recent large datasets measuring the gene expression of millions of possible gene promoter sequences provide a resource to design and train optimized deep neural network architectures to predict expression from sequences. High predictive performance due to the modeling of dependencies within and between regulatory sequences is an enabler for biological discoveries in gene regulation through model interpretation techniques. To understand the regulatory code that delineates gene expression, we have designed a novel deep-learning model (CRMnet) to predict gene expression in Saccharomyces cerevisiae. Our model outperforms the current benchmark models and achieves a Pearson correlation coefficient of 0.971 and a mean squared error of 3.200. Interpretation of informative genomic regions determined from model saliency maps, and overlapping the saliency maps with known yeast motifs, supports that our model can successfully locate the binding sites of transcription factors that actively modulate gene expression. We compare our model's training times on a large compute cluster with GPUs and Google TPUs to indicate practical training times on similar datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CRMnet: A deep learning model for predicting gene expression from large regulatory sequence datasets.

Abstract

Talk to us

Similar Papers

More From: Frontiers in big data

Lead the way for us

Similar Papers

Correlating Gene Expression Variation with cis-Regulatory Polymorphism in Saccharomyces cerevisiae
Kevin Chen ... Erik Van Nimwegen
Genome Biology and Evolution | VOL. 2
Kevin Chen, et. al.Kevin Chen ... Erik Van Nimwegen
01 Jan 2009
Genome Biology and Evolution | VOL. 2

Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations
Peter K Rogan ... Peter Rogan
F1000Research | VOL. 7
Peter K Rogan, et. al.Peter K Rogan ... Peter Rogan
25 Mar 2019
F1000Research | VOL. 7

Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations.
Ruipeng Lu ... Peter K Rogan
F1000Research | VOL. 7
Ruipeng Lu, et. al.Ruipeng Lu ... Peter K Rogan
08 Apr 2019
F1000Research | VOL. 7

Optogenetic Repressors of Gene Expression in Yeasts Using Light-Controlled Nuclear Localization.
Stephanie H Geller ... Barbara Di Ventura
Cellular and Molecular Bioengineering | VOL. 12
Stephanie H Geller, et. al.Stephanie H Geller ... Barbara Di Ventura
24 Sep 2019
Cellular and Molecular Bioengineering | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CRMnet: A deep learning model for predicting gene expression from large regulatory sequence datasets.

Abstract

Talk to us

Similar Papers

More From: Frontiers in big data