Abstract

AbstractDeep learning rapidly promotes many fields with successful stories in natural language processing. An architecture of deep neural network (DNN) combining tree‐structured long short‐term memory (Tree‐LSTM) network and back‐propagation neural network (BPNN) is developed for predicting physical properties. Inspired by the natural language processing in artificial intelligence, we first developed a strategy for data preparation including encoding molecules with canonical molecular signatures and vectorizing bond‐substrings by an embedding algorithm. Then, the dynamic neural network named Tree‐LSTM is employed to depict molecular tree data‐structures while the BPNN is used to correlate properties. To evaluate the performance of proposed DNN, the critical properties of nearly 1,800 compounds are employed for training and testing the DNN models. As compared with classical group contribution methods, it can be demonstrated that the learned DNN models are able to provide more accurate prediction and cover more diverse molecular structures without considering frequencies of substructures.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call