Investigations into the Efficiency of Computer-Aided Synthesis Planning.

Peter B R Hartog,Annie M Westerlund,Igor V Tetko,Samuel Genheden

doi:10.1021/acs.jcim.4c01821

Peter B R Hartog, Annie M Westerlund + Show 2 more

Open Access

https://doi.org/10.1021/acs.jcim.4c01821

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

The efficiency of machine learning (ML) models is crucial to minimize inference times and reduce the carbon footprints of models deployed in production environments. Current models employed in retrosynthesis to generate a synthesis route from a target molecule to purchasable compounds are prohibitively slow. The model operates in a single-step fashion in a tree search algorithm by predicting reactant molecules given a product molecule as input. In this study, we investigate the ability of alternative transformer architectures, knowledge distillation (KD), and simple hyper-parameter optimization to decrease inference times of the Chemformer model. Initially, we assess the ability of closely related transformer architectures and conclude that these models under-performed when using KD. Additionally, we investigate the effects of feature-based and response-based KD together with hyper-parameters optimized based on inference sample time and model accuracy. We find that although reducing model size and improving single-step speed are important, our results indicate that multi-step search efficiency is more significantly influenced by the diversity and confidence of single-step models. Based on this work, further research should use KD in combination with other techniques, as multi-step speed continues to prevent proper integration of synthesis planning. However, in Monte Carlo-based (MC) multi-step retrosynthesis, other factors play a crucial role in balancing exploration and exploitation during the search process, often outweighing the direct impact of single-step model speed and carbon footprints.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Investigations into the Efficiency of Computer-Aided Synthesis Planning.

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of chemical information and modeling

Lead the way for us

Journal: Journal of chemical information and modeling	Publication Date: Jan 31, 2025
License type: cc-by-nc-nd

Similar Papers

Mitigating carbon footprint for knowledge distillation based deep learning model compression.
Kazi Rafat ... Shafin Rahman
PLOS ONE | VOL. 18
Kazi Rafat, et. al.Kazi Rafat ... Shafin Rahman
15 May 2023
PLOS ONE | VOL. 18

Improving the accuracy of pruned network using knowledge distillation
Setya Widyawan Prakosa ... Zhao-Hong Chen
Pattern Analysis and Applications | VOL. 24
Setya Widyawan Prakosa, et. al.Setya Widyawan Prakosa ... Zhao-Hong Chen
17 Nov 2020
Pattern Analysis and Applications | VOL. 24

Cross-domain knowledge distillation for text classification
Shaokang Zhang ... Jianlong Tan
Neurocomputing | VOL. 509
Shaokang Zhang, et. al.Shaokang Zhang ... Jianlong Tan
19 Aug 2022
Neurocomputing | VOL. 509

Knowledge distillation approach for skin cancer classification on lightweight deep learning model.
Suman Saha ... Muhammad Firoz Mridha
Healthcare technology letters | VOL. 12
Suman Saha, et. al.Suman Saha ... Muhammad Firoz Mridha
01 Jan 2025
Healthcare technology letters | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Investigations into the Efficiency of Computer-Aided Synthesis Planning.

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of chemical information and modeling