Optimizing Chemical Reactions with Deep Reinforcement Learning.

Zhenpeng Zhou,Richard N Zare,Xiaocheng Li

doi:10.1021/acscentsci.7b00492

Zhenpeng Zhou, Richard N Zare + Show 1 more

Open Access

https://doi.org/10.1021/acscentsci.7b00492

Copy DOI

Journal: ACS central science	Publication Date: Dec 15, 2017
Citations: 329	License type: publisher-specific-oa

Affiliation: Stanford University

Abstract

Deep reinforcement learning was employed to optimize chemical reactions. Our model iteratively records the results of a chemical reaction and chooses new experimental conditions to improve the reaction outcome. This model outperformed a state-of-the-art blackbox optimization algorithm by using 71% fewer steps on both simulations and real reactions. Furthermore, we introduced an efficient exploration strategy by drawing the reaction conditions from certain probability distributions, which resulted in an improvement on regret from 0.062 to 0.039 compared with a deterministic policy. Combining the efficient exploration policy with accelerated microdroplet reactions, optimal reaction conditions were determined in 30 min for the four reactions considered, and a better understanding of the factors that control microdroplet reactions was reached. Moreover, our model showed a better performance after training on reactions with similar or even dissimilar underlying mechanisms, which demonstrates its learning ability.

Full Text