Abstract

Quantitative structure – activity relationships (QSARs) for the Lowest Observed Adverse Effect Level (LOAEL) for a large set of organic compounds (n = 341) are suggested. The molecular structures of these compounds are represented by Simplified Molecular Input-Line Entry Systems (SMILES). A criteria for the estimation quality of split into the “visible” training set (used for developing a model) and “invisible” external validation set is suggested. The correlation between the above criterion and the predictive potential of developed QSAR model (root-mean-square error for “invisible” validation set) has been detected. One-variable models are built up for several different splits into the “visible” training set and “invisible” validation set. The statistical quality of these models is quite good. Mechanistic interpretation and the domain of applicability for these models are defined according to probabilistic point of view. The methodology for defining applicability domain in QSAR modeling with SMILES notation based optimal descriptors is presented.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.