Artificial applicability labels for improving policies in retrosynthesis prediction

Esben Jannik Bjerrum,Ola Engkvist,Amol Thakkar

doi:10.1088/2632-2153/abcf90

Esben Jannik Bjerrum, Ola Engkvist + Show 1 more

Open Access

https://doi.org/10.1088/2632-2153/abcf90

Copy DOI

Abstract

Automated retrosynthetic planning algorithms are a research area of increasing importance. Automated reaction-template extraction from large datasets, in conjunction with neural-network-enhanced tree-search algorithms, can find plausible routes to target compounds in seconds. However, the current method for training neural networks to predict suitable templates for a given target product leads to many predictions that are not applicable in silico. Most templates in the top 50 suggested templates cannot be applied to the target molecule to perform the virtual reaction. Here, we describe how to generate data and train a neural network policy that predicts whether templates are applicable or not. First, we generate a massive training dataset by applying each retrosynthetic template to each product from our reaction database. Second, we train a neural network to perform near-perfect prediction of the applicability labels on a held-out test set. The trained network is then joined with a policy model trained to predict and prioritize templates using the labels from the original dataset. The combined model was found to outperform the policy model in a route-finding task using 1700 compounds from our internal drug-discovery projects.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning: Science and Technology	Publication Date: Dec 24, 2020
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Artificial applicability labels for improving policies in retrosynthesis prediction

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology

Lead the way for us

Similar Papers

Neural Network with Combined Approximation of the Surface of the Response
Oleksandra S Mishchuk ... Pavlo B Vitynskyi
Research Bulletin of the National Technical University of Ukraine "Kyiv Politechnic Institute" | VOL. 0
Oleksandra S Mishchuk, et. al.Oleksandra S Mishchuk ... Pavlo B Vitynskyi
12 Jun 2018
Research Bulletin of the National Technical University of Ukraine "Kyiv Politechnic Institute" | VOL. 0

Multiprocessor scheduling and neural network training methods using shuffled frog-leaping algorithm
Binodini Tripathy ... Sasmita Kumari Padhy
Computers & Industrial Engineering | VOL. 80
Binodini Tripathy, et. al.Binodini Tripathy ... Sasmita Kumari Padhy
16 Dec 2014
Computers & Industrial Engineering | VOL. 80

Optimization of Energy Consumption in Chemical Production Based on Descriptive Analytics and Neural Network Modeling
Alexey I Shinkevich ... Yulia V Vertakova
Mathematics | VOL. 9
Alexey I Shinkevich, et. al.Alexey I Shinkevich ... Yulia V Vertakova
06 Feb 2021
Mathematics | VOL. 9

A novel method for training neural networks for time-series prediction in environmental systems
M.J Aitkenhead ... S Palmer
Ecological Modelling | VOL. 162
M.J Aitkenhead, et. al.M.J Aitkenhead ... S Palmer
28 Jan 2003
Ecological Modelling | VOL. 162

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Artificial applicability labels for improving policies in retrosynthesis prediction

Abstract

Talk to us

Similar Papers

More From: Machine Learning: Science and Technology