Extending in Silico Protein Target Prediction Models to Include Functional Effects.

Lewis H Mervin,Lars Brive,Avid M Afzal,Andreas Bender,Ola Engkvist

doi:10.3389/fphar.2018.00613

Lewis H Mervin, Lars Brive + Show 3 more

Open Access

https://doi.org/10.3389/fphar.2018.00613

Copy DOI

Abstract

In silico protein target deconvolution is frequently used for mechanism-of-action investigations; however existing protocols usually do not predict compound functional effects, such as activation or inhibition, upon binding to their protein counterparts. This study is hence concerned with including functional effects in target prediction. To this end, we assimilated a bioactivity training set for 332 targets, comprising 817,239 active data points with unknown functional effect (binding data) and 20,761,260 inactive compounds, along with 226,045 activating and 1,032,439 inhibiting data points from functional screens. Chemical space analysis of the data first showed some separation between compound sets (binding and inhibiting compounds were more similar to each other than both binding and activating or activating and inhibiting compounds), providing a rationale for implementing functional prediction models. We employed three different architectures to predict functional response, ranging from simplistic random forest models (‘Arch1’) to cascaded models which use separate binding and functional effect classification steps (‘Arch2’ and ‘Arch3’), differing in the way training sets were generated. Fivefold stratified cross-validation outlined cascading predictions provides superior precision and recall based on an internal test set. We next prospectively validated the architectures using a temporal set of 153,467 of in-house data points (after a 4-month interim from initial data extraction). Results outlined Arch3 performed with the highest target class averaged precision and recall scores of 71% and 53%, which we attribute to the use of inactive background sets. Distance-based applicability domain (AD) analysis outlined that Arch3 provides superior extrapolation into novel areas of chemical space, and thus based on the results presented here, propose as the most suitable architecture for the functional effect prediction of small molecules. We finally conclude including functional effects could provide vital insight in future studies, to annotate cases of unanticipated functional changeover, as outlined by our CHRM1 case study.

Highlights

Target deconvolution is an important step in the subsequent analysis of data gleaned from phenotypic screenings, to identify the modulated targets of active compounds and enable the continued dissection of the biological processes involved in a system of interest (Terstappen et al, 2007; Raida, 2011; Kotz, 2012; Lee and Bogyo, 2013)
Imposing only two functional labels may be an over-simplification, this is preferred to the complex situation resulting from the original complex BioAssay Ontology (BAO) labeling, since it reduces training data into a binary problem per protein target, ensures larger numbers of compounds are retrained within each MOA, and that generated predictions are compared between the complete spectra of functional predictions between targets
Changes in certain moieties are shown to affect binding outcome more than others; for example, one study highlighted that steric modifications near a basic nitrogen, methylation of indoles, and aniline nitrogen substitutions appeared to play important roles in determining functional activity while keeping overall structure rather similar (Dosa and Amin, 2016)

Summary

INTRODUCTION

Target deconvolution is an important step in the subsequent analysis of data gleaned from phenotypic screenings, to identify the modulated targets of active compounds and enable the continued dissection of the biological processes involved in a system of interest (Terstappen et al, 2007; Raida, 2011; Kotz, 2012; Lee and Bogyo, 2013). The characterization of the functional effects of compounds is often a principle shortcoming for current in silico methods, since many protocols only provide probability for compound affinity at a target (Drakakis et al, 2013; Koutsoukas et al, 2013; Mervin et al, 2015) Existing protocols, such as the Similarity Ensemble Approach (SEA) (Keiser et al, 2007) and Prediction of Activity Spectra for Substances (PASS) (Lagunin et al, 2000), provide functional annotation by training on a compound set extracted from the MDL Drug Data Report [MDDR] (2006). The cross validation and time-split performance of the approaches has provided guidance into the choice of architecture to be deployed in-house for future triage processes

MATERIALS AND METHODS

RESULTS

Cross Validation Results

DISCUSSION

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Pharmacology	Publication Date: Jun 11, 2018
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Extending in Silico Protein Target Prediction Models to Include Functional Effects.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Pharmacology

Lead the way for us

Similar Papers

Deep Learning Improves Speed and Accuracy of Prostate Gland Segmentations on Magnetic Resonance Imaging for Targeted Biopsy.
Simon John Christoph Soerensen ... Indrani Bhattacharya
Journal of Urology | VOL. 206
Simon John Christoph Soerensen, et. al.Simon John Christoph Soerensen ... Indrani Bhattacharya
21 Apr 2021
Journal of Urology | VOL. 206

Uncertainty quantification: Can we trust artificial intelligence in drug discovery?
Jie Yu ... Mingyue Zheng
iScience | VOL. 25
Jie Yu, et. al.Jie Yu ... Mingyue Zheng
21 Jul 2022
iScience | VOL. 25

Computed Tomography-Based Radiomics Model to Predict Central Cervical Lymph Node Metastases in Papillary Thyroid Carcinoma: A Multicenter Study.
Jingjing Li ... Xicheng Song
Frontiers in Endocrinology | VOL. 12
Jingjing Li, et. al.Jingjing Li ... Xicheng Song
21 Oct 2021
Frontiers in Endocrinology | VOL. 12

Mammography-based radiomics for predicting the risk of breast cancer recurrence: a multicenter study.
Ning Mao ... Ping Yin
The British Journal of Radiology | VOL. 94
Ning Mao, et. al.Ning Mao ... Ping Yin
14 Sep 2021
The British Journal of Radiology | VOL. 94

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Extending in Silico Protein Target Prediction Models to Include Functional Effects.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Pharmacology