Abstract

The aim of the following research is to assess the applicability of calculated quantum properties of molecular fragments as molecular descriptors in machine learning classification task. The research is based on bio-concentration and QM9-extended databases. A number of compounds with results from quantum-chemical calculations conducted with Psi4 quantum chemistry package was also added to the quantum properties database. Classification results are compared with a baseline of random guesses and predictions obtained with the traditional RDKit generated molecular descriptors. Chosen classification metrics show that results obtained with fragments quantum descriptors fall between results from baseline and those provided by molecular descriptors widely applied in cheminformatics. According to the results, the implementation of principal component analysis, causes a drop in categorization metrics.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call