An ensemble model of QSAR tools for regulatory risk assessment.

Prachi Pradeep,Stephen J Merrill,Richard J Povinelli,Shannon White

doi:10.1186/s13321-016-0164-0

Abstract

Quantitative structure activity relationships (QSARs) are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR predictions can be used for chemical risk assessment for protection of human and environmental health, which makes them interesting to regulators, especially in the absence of experimental data. For compatibility with regulatory use, QSAR models should be transparent, reproducible and optimized to minimize the number of false negatives. In silico QSAR tools are gaining wide acceptance as a faster alternative to otherwise time-consuming clinical and animal testing methods. However, different QSAR tools often make conflicting predictions for a given chemical and may also vary in their predictive performance across different chemical datasets. In a regulatory context, conflicting predictions raise interpretation, validation and adequacy concerns. To address these concerns, ensemble learning techniques in the machine learning paradigm can be used to integrate predictions from multiple tools. By leveraging various underlying QSAR algorithms and training datasets, the resulting consensus prediction should yield better overall predictive ability. We present a novel ensemble QSAR model using Bayesian classification. The model allows for varying a cut-off parameter that allows for a selection in the desirable trade-off between model sensitivity and specificity. The predictive performance of the ensemble model is compared with four in silico tools (Toxtree, Lazar, OECD Toolbox, and Danish QSAR) to predict carcinogenicity for a dataset of air toxins (332 chemicals) and a subset of the gold carcinogenic potency database (480 chemicals). Leave-one-out cross validation results show that the ensemble model achieves the best trade-off between sensitivity and specificity (accuracy: 83.8 % and 80.4 %, and balanced accuracy: 80.6 % and 80.8 %) and highest inter-rater agreement [kappa (κ): 0.63 and 0.62] for both the datasets. The ROC curves demonstrate the utility of the cut-off feature in the predictive ability of the ensemble model. This feature provides an additional control to the regulators in grading a chemical based on the severity of the toxic endpoint under study.Electronic supplementary materialThe online version of this article (doi:10.1186/s13321-016-0164-0) contains supplementary material, which is available to authorized users.

Highlights

Chemical risk assessment associated with chemical exposure is necessary for the protection of human and environmental health
Sensitivity, specificity, balanced accuracy, Positive predictive value (PPV) and negative predictive value (NPV) Statistical performance of the ensemble model in comparison to the various Quantitative structure activity relationships (QSARs) tools is summarized in Tables 2 and 3
The accuracy (>80 %), balanced accuracy (>78 %), PPV (>79 %) and NPV (>79 %) of the Bayes ensemble model is highly improved compared to the base classifiers (QSAR tools) for both the datasets

Summary

Introduction

Chemical risk assessment associated with chemical exposure is necessary for the protection of human and environmental health. Toxicity or adverse effects are major reasons for failure of a potential pharmaceutical, an industrial chemical or a medical device [1,2,3]. Regulatory risk assessment is the process that ensures marketing of safe and effective drugs, medical devices and other consumer products. Several commercial (free or proprietary) and open source in silico QSAR tools are available that can predict the toxic effects of a chemical based on its chemical structure [6, 7]. QSAR models are widely used for identification of chemicals that have a desired biological effect (e.g. drug leads) or for early prediction of potential toxic effects in the pharmaceutical industry. QSAR models can be used to: (1) supplement experimental data, (2) support prioritization in the absence of experimental data, and (3) replace experimental animal testing methods [8, 9]

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Cheminformatics	Publication Date: Sep 22, 2016
Citations: 50	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An ensemble model of QSAR tools for regulatory risk assessment.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics

Lead the way for us

Similar Papers

Ensemble hologram quantitative structure activity relationship model of the chromatographic retention index of aldehydes and ketones
Bin Lei ... Long Jiao
Se pu = Chinese journal of chromatography | VOL. 39
Bin Lei, et. al.Bin Lei ... Long Jiao
01 Mar 2021
Se pu = Chinese journal of chromatography | VOL. 39

Ligand-based pharmacophore modelling and virtual screening for the identification of amyloid-beta diagnostic molecules
Ephraim Felix Marondedze ... Penny Poomani Govender
Journal of Molecular Graphics and Modelling | VOL. 101
Ephraim Felix Marondedze, et. al.Ephraim Felix Marondedze ... Penny Poomani Govender
21 Aug 2020
Journal of Molecular Graphics and Modelling | VOL. 101

Prediction of Log P of Halogenated Alkanes by Their ELUMO and Number of Chlorine and Carbon
... Walter Z Tang
Environmental Processes | VOL. 3
, et. al. ... Walter Z Tang
29 Jan 2016
Environmental Processes | VOL. 3

Multi-Strategy Assessment of Different Uses of QSAR under REACH Analysis of Alternatives to Advance Information Transparency.
Kazue Chinen ... Timothy Malloy
International Journal of Environmental Research and Public Health | VOL. 19
Kazue Chinen, et. al.Kazue Chinen ... Timothy Malloy
04 Apr 2022
International Journal of Environmental Research and Public Health | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ensemble model of QSAR tools for regulatory risk assessment.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Cheminformatics