Abstract

In this paper, we present a novel approach to predicting chemical structures from their infrared (IR) spectra using deep Q-learning. IR spectra measurements are widely used in chemical analysis because they provide information on the types and characteristics of chemical bonds present within compounds. However, there are currently no algorithms to predict the entire chemical structure of a broad range of compounds relying solely on IR spectra, unless there is an exact or closely matched spectrum in an existing reference spectra library. To address this, we apply double deep Q-learning for automated prediction of the entire chemical structures of organic compounds based on IR spectra. Our method builds predicted structures by starting from a single carbon atom and subsequently adding an atom and bond step-by-step by ranking the rewards of each possible addition based on Q-values. We devised new structural similarity metrics, atom bond count and substructure count metrics to achieve our goal. Compared to the commonly used structural similarity score, the Jaccard index of extended-connectivity fingerprints, the devised metrics exhibit more suitable properties for Q-learning. The deep Q-model, which uses the combination of our two proposed metrics, gives the overall best performance and can generate structures similar to the actual structures in terms of their structural features and molecular weight in most tested cases.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.