Abstract

The neural network correction approach that was previously proposed to achieve the chemical accuracy for first-principles methods is further developed by a combination of the Kennard-Stone sampling and Bootstrapping methods. As a result, the accuracy of the calculated heat of formation is improved further, and moreover, the error bar of each calculated result can be determined. An enlarged database (Chen/13), which contains a total of 539 molecules made of the common elements H, C, N, O, F, S, and Cl, is constructed and is divided into the training (449 molecules) and testing (90 molecules) data sets with the Kennard-Stone sampling method. Upon the neural network correction, the mean absolute deviation (MAD) of the B3LYP/6-311+G(3df,2p) calculated heat of formation is reduced from 10.92 to 1.47 kcal mol(-1) and 14.95 to 1.31 kcal mol(-1) for the training and testing data sets, respectively. Furthermore, the Bootstrapping method, a broadly used statistical method, is employed to assess the accuracy of each neural-network prediction by determining its error bar. The average error bar for the testing data set is 1.05 kcal mol(-1), therefore achieving the chemical accuracy. When a testing molecule falls into the regions of the "Chemical Space" where the distribution density of the training molecules is high, its predicted error bar is comparatively small, and thus, the predicted value is accurate as it should be. As a challenge, the resulting neural-network is employed to discern the discrepancy among the existing experimental data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.