Abstract

In this work we show the effectiveness of 2D structural fingerprints in the prediction of aquatic toxicity of chemical compounds, creating a self-contained system for structure-based aquatic toxicity classification. Using the data from the U.S. Environmental Protection Agency Fat Head Minnow (EPA-FHM) dataset [1] we build a nonlinear RBF SVM [2] classifier that distinguishes acutely toxic compounds from less toxic compounds, loosely according to the criterion stipulated by the E.U. Reach legislation [3]. The classifier achieves up to 86% accuracy in leave-one-out validation using 580 of the dataset’s 614 compounds. This performance is comparable with models built from the same dataset using more sophisticated molecular descriptors, such as AutoMEP and Sterimol descriptors [4]. We apply our classification model to predict the aquatic toxicity of 3M compounds in the MMsINC database [5]. Furthermore, we create a linear SVM model using the same technique and apply it to the MMsINC data, with the additional integration of the EXPLAIN system [6] which allows us to show which structural features are responsible for the model classifying a molecule as less toxic or acutely toxic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.