Hold out the genome: a roadmap to solving the cis-regulatory code.

Carl G De Boer,Jussi Taipale

doi:10.1038/s41586-023-06661-w

Abstract

Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The 'cis-regulatory code' - how cells interpret DNA sequences to determine when, where and how much genes should be expressed - has proven to be exceedingly complex. Recently, advances in the scale and resolution of functional genomics assays and machine learning have enabled substantial progress towards deciphering this code. However, the cis-regulatory code will probably never be solved if models are trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and our genome is too short and has insufficient sequence diversity to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable testing a far larger sequence space than exists in our genomes, and designed DNA sequences enable targeted queries to maximally improve the models. As the same biochemical principles are used to interpret DNA regardless of its source, models trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by a combination of machine learning and massively parallel assays using synthetic DNA.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nature	Publication Date: Dec 13, 2023
Citations: 23	License type: cc-by

R Discovery Prime

R Discovery Prime

Hold out the genome: a roadmap to solving the cis-regulatory code.

Abstract

Talk to us

Similar Papers

More From: Nature

Lead the way for us

Similar Papers

Expression-Guided In Silico Evaluation of Candidate Cis Regulatory Codes for Drosophila Muscle Founder Cells
Anthony A Philippakis ... Stephen S Gisselbrecht
PLoS Computational Biology | VOL. 2
Anthony A Philippakis, et. al.Anthony A Philippakis ... Stephen S Gisselbrecht
01 May 2006
PLoS Computational Biology | VOL. 2

Deciphering the multi-scale, quantitative cis-regulatory code
Seungsoo Kim ... Joanna Wysocka
Molecular cell | VOL. 83
Seungsoo Kim, et. al.Seungsoo Kim ... Joanna Wysocka
23 Jan 2023
Molecular cell | VOL. 83

Seven myths of how transcription factors read the cis-regulatory code
Julia Zeitlinger
Current Opinion in Systems Biology | VOL. 23
Julia ZeitlingerJulia Zeitlinger
04 Sep 2020
Current Opinion in Systems Biology | VOL. 23

Pharmacologic profiling of transcriptional targets deciphers promoter logic
W J Freebern ... D D Taub
The Pharmacogenomics Journal | VOL. 5
W J Freebern, et. al.W J Freebern ... D D Taub
26 Jul 2005
The Pharmacogenomics Journal | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hold out the genome: a roadmap to solving the cis-regulatory code.

Abstract

Talk to us

Similar Papers

More From: Nature