Biocatalysis is becoming a data science. High-throughput experimentation generates a rapidly increasing stream of biocatalytic data, which is the raw material for mechanistic and novel data-driven modeling approaches for the predictive design of improved biocatalysts and novel bioprocesses. The holistic and molecular understanding of enzymatic reaction systems will enable us to identify and overcome kinetic bottlenecks and shift the thermodynamics of a reaction. The full characterization and modeling of reaction systems is a community effort; therefore, published methods and results should be findable, accessible, interoperable, and reusable (FAIR), which is achieved by developing standardized data exchange formats, by a complete and reproducible documentation of experimentation, by collaborative platforms for developing sustainable software and for analyzing data, and by repositories for publishing results together with raw data. The FAIRification of biocatalysis is a prerequisite to developing highly automated laboratory infrastructures that improve the reproducibility of scientific results and reduce the time and costs required to develop novel synthesis routes.
Read full abstract