Abstract

Computer-aided promoter design is a major development trend in synthetic promoter engineering. Various deep learning models have been used to evaluate or screen synthetic promoters, but there have been few works on de novo promoter design. To explore the potential ability of generative models in promoter design, we established a diffusion-based generative model for promoter design in Escherichia coli. The model was completely driven by sequence data and could study the essential characteristics of natural promoters, thus generating synthetic promoters similar to natural promoters in structure and component. We also improved the calculation method of FID indicator, using a convolution layer to extract the feature matrix of the promoter sequence instead. As a result, we got an FID equal to 1.37, which meant synthetic promoters have a distribution similar to that of natural ones. Our work provides a fresh approach to de novo promoter design, indicating that a completely data-driven generative model is feasible for promoter design.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call