Correlated RNN Framework to Quickly Generate Molecules with Desired Properties for Energetic Materials in the Low Data Regime.

Chuan Li,Xuemei Pu,Ming Sun,Yanzhi Guo,Yuan Yuan,Yan Zeng,Guangchuan Wang,Chenghui Wang,Qiaolin Gou

doi:10.1021/acs.jcim.2c00997

Abstract

Motivated by the challenging of deep learning on the low data regime and the urgent demand for intelligent design on highly energetic materials, we explore a correlated deep learning framework, which consists of three recurrent neural networks (RNNs) correlated by the transfer learning strategy, to efficiently generate new energetic molecules with a high detonation velocity in the case of very limited data available. To avoid the dependence on the external big data set, data augmentation by fragment shuffling of 303 energetic compounds is utilized to produce 500,000 molecules to pretrain RNN, through which the model can learn sufficient structure knowledge. Then the pretrained RNN is fine-tuned by focusing on the 303 energetic compounds to generate 7153 molecules similar to the energetic compounds. In order to more reliably screen the molecules with a high detonation velocity, the SMILE enumeration augmentation coupled with the pretrained knowledge is utilized to build an RNN-based prediction model, through which R2 is boosted from 0.4446 to 0.9572. The comparable performance with the transfer learning strategy based on an existing big database (ChEMBL) to produce the energetic molecules and drug-like ones further supports the effectiveness and generality of our strategy in the low data regime. High-precision quantum mechanics calculations further confirm that 35 new molecules present a higher detonation velocity and lower synthetic accessibility than the classic explosive RDX, along with good thermal stability. In particular, three new molecules are comparable to caged CL-20 in the detonation velocity. All the source codes and the data set are freely available at https://github.com/wangchenghuidream/RNNMGM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Correlated RNN Framework to Quickly Generate Molecules with Desired Properties for Energetic Materials in the Low Data Regime.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling

Lead the way for us

Journal: Journal of Chemical Information and Modeling	Publication Date: Aug 23, 2022
Citations: 12

Similar Papers

"Microscopic" and "Macroscopic" Level of the Errors for Detonation Characteristics Calculations : Pedigree of the Errors
T S Pivina ... E A Arnautova
Le Journal de Physique IV | VOL. 05
T S Pivina, et. al.T S Pivina ... E A Arnautova
01 May 1995
Le Journal de Physique IV | VOL. 05

A novel energetic framework with the combination of 5,6-fused triazolo-triazine and nitropyrazole-tetrazole for energy-stability balanced explosive
Cheng-Chuang Li ... Hong-Wei Yang
Defence Technology | VOL. 27
Cheng-Chuang Li, et. al.Cheng-Chuang Li ... Hong-Wei Yang
28 Oct 2022
Defence Technology | VOL. 27

Tricyclic compounds with 1,4,2,5-dioxadiazine bridged triazoles and pyrazoles as potential energetic materials
Cong-Cong Ge ... Hong-Wei Yang
Energetic Materials Frontiers | VOL. 4
Cong-Cong Ge, et. al.Cong-Cong Ge ... Hong-Wei Yang
22 Dec 2022
Tricyclic compounds with 1,4,2,5-dioxadiazine bridged triazoles and pyrazoles as potential energetic materials
Cong-Cong Ge ... Hong-Wei Yang

An Efficient Data Augmentation Method for Automatic Modulation Recognition from Low-Data Imbalanced-Class Regime
Shengyun Wei ... Zhaolong Sun
Applied Sciences | VOL. 13
Shengyun Wei, et. al.Shengyun Wei ... Zhaolong Sun
01 Mar 2023
Applied Sciences | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Correlated RNN Framework to Quickly Generate Molecules with Desired Properties for Energetic Materials in the Low Data Regime.

Abstract

Talk to us

Similar Papers

More From: Journal of Chemical Information and Modeling