Modelling of ready biodegradability based on combined public and industrial data sources

F Lunghini,G Marcou,P Gantzer,P Azam,D Horvath,E Van Miert,A Varnek

doi:10.1080/1062936x.2019.1697360

Abstract

ABSTRACTThe European Registration, Evaluation, Authorization and Restriction of Chemical Substances Regulation, requires marketed chemicals to be evaluated for Ready Biodegradability (RB), considering in silico prediction as valid alternative to experimental testing. However, currently available models may not be relevant to predict compounds of industrial interest, due to accuracy and applicability domain restriction issues. In this work, we present a new and extended RB dataset (2830 compounds), issued by the merging of several public data sources. It was used to train classification models, which were externally validated and benchmarked against already-existing tools on a set of 316 compounds coming from the industrial context. New models showed good performances in terms of predictive power (Balance Accuracy (BA) = 0.74–0.79) and data coverage (83–91%). The Generative Topographic Mapping approach identified several chemotypes and structural motifs unique to the industrial dataset, highlighting for which chemical classes currently available models may have less reliable predictions. Finally, public and industrial data were merged into global dataset containing 3146 compounds. This is the biggest dataset reported in the literature so far, covering some chemotypes absent in the public data. Thus, predictive model developed on the Global dataset has larger applicability domain than the existing ones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Modelling of ready biodegradability based on combined public and industrial data sources

Abstract

Talk to us

Similar Papers

More From: SAR and QSAR in Environmental Research

Lead the way for us

Journal: SAR and QSAR in Environmental Research	Publication Date: Dec 20, 2019
Citations: 15

Similar Papers

An analysis and classification of public information security data sources used in research and practice
Clemens Sauerwein ... Ruth Breu
Computers & Security | VOL. 82
Clemens Sauerwein, et. al.Clemens Sauerwein ... Ruth Breu
25 Dec 2018
Computers & Security | VOL. 82

Surveillance of methadone-related adverse drug events using multiple public health data sources
Shannon A Sims ... Christina A Porucznik
Journal of Biomedical Informatics | VOL. 40
Shannon A Sims, et. al.Shannon A Sims ... Christina A Porucznik
01 Nov 2006
Journal of Biomedical Informatics | VOL. 40

The role of interoperable data standards in precision livestock farming in extensive livestock systems: A review
Christiane Bahlo ... Mark Trotter
Computers and Electronics in Agriculture | VOL. 156
Christiane Bahlo, et. al.Christiane Bahlo ... Mark Trotter
11 Dec 2018
Computers and Electronics in Agriculture | VOL. 156

Sensitivity of modeled residential fine particulate matter exposure to select building and source characteristics: A case study using public data in Boston, MA
Chad W Milando ... M Patricia Fabian
Science of the Total Environment | VOL. 840
Chad W Milando, et. al.Chad W Milando ... M Patricia Fabian
09 Jun 2022
Science of the Total Environment | VOL. 840

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Modelling of ready biodegradability based on combined public and industrial data sources

Abstract

Talk to us

Similar Papers

More From: SAR and QSAR in Environmental Research