Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.

Benoît Baillif,Joerg Wichard,David Rouquié,Oscar Méndez-Lucio

doi:10.3389/fchem.2020.00296

Benoît Baillif, Joerg Wichard + Show 2 more

Open Access

https://doi.org/10.3389/fchem.2020.00296

Copy DOI

Journal: Frontiers in Chemistry	Publication Date: Apr 23, 2020
Citations: 19	License type: CC BY 4.0

Affiliation: Bayer (France), Bayer (Germany)

Abstract

Pharmaceutical or phytopharmaceutical molecules rely on the interaction with one or more specific molecular targets to induce their anticipated biological responses. Nonetheless, these compounds are also prone to interact with many other non-intended biological targets, also known as off-targets. Unfortunately, off-target identification is difficult and expensive. Consequently, QSAR models predicting the activity on a target have gained importance in drug discovery or in the de-risking of chemicals. However, a restricted number of targets are well characterized and hold enough data to build such in silico models. A good alternative to individual target evaluations is to use integrative evaluations such as transcriptomics obtained from compound-induced gene expression measurements derived from cell cultures. The advantage of these particular experiments is to capture the consequences of the interaction of compounds on many possible molecular targets and biological pathways, without having any constraints concerning the chemical space. In this work, we assessed the value of a large public dataset of compound-induced transcriptomic data, to predict compound activity on a selection of 69 molecular targets. We compared such descriptors with other QSAR descriptors, namely the Morgan fingerprints (similar to extended-connectivity fingerprints). Depending on the target, active compounds could show similar signatures in one or multiple cell lines, whether these active compounds shared similar or different chemical structures. Random forest models using gene expression signatures were able to perform similarly or better than counterpart models built with Morgan fingerprints for 25% of the target prediction tasks. These performances occurred mostly using signatures produced in cell lines showing similar signatures for active compounds toward the considered target. We show that compound-induced transcriptomic data could represent a great opportunity for target prediction, allowing to overcome the chemical space limitation of QSAR models.

Highlights

Signatures (GESs) of cell line responses to so-called perturbagens (Subramanian et al, 2017)
Gene Expression Signatures (GESs) is represented by an instance, that is a combination of a perturbagen, cell line, concentration and time point, and is composed by the plate-normalized expression z-scores of the whole genome, inferred from 978 landmark genes
We investigated the link between compound structure information (n = 9,035) and their corresponding induced biological responses captured by GESs (n = 39,544) in human tumor cell lines and evaluated the potential of machine learning approaches to infer about molecular targets involved in the compound bioactivity

Summary

INTRODUCTION

Signatures (GESs) of cell line responses to so-called perturbagens (Subramanian et al, 2017). A commonly used technique is to compute descriptors from chemical structures, like the extended-connectivity fingerprints (ECFPs) and use them for prediction, relying on the quantitative structure-activity relationship (QSAR) principle, i.e., molecules sharing a similar structure may share a similar activity profile (Rogers and Hahn, 2010; Cherkasov et al, 2014) Such molecule descriptors show limitations: they do not perform well for all target prediction tasks depending on the quantity and quality of available activity data, prediction is limited to the applicability domain (depending on the training set used), and a small change in chemical structure can lead to a large change in biological response (activity cliffs). A large public CMAP L1000 dataset was released representing more than 300,000 Gene Expression

MATERIALS AND METHODS

DISCUSSION

Findings

DATA AVAILABILITY STATEMENT

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Chemistry

Lead the way for us

Similar Papers

Abstract A268: TAS-119, a selective Aurora A inhibitor, enhanced the antitumor efficacy of taxanes in multiple human tumor cell lines including paclitaxel-resistant cells.
Akihiro Miura ... Hiroshi Hirai
Molecular Cancer Therapeutics | VOL. 12
Akihiro Miura, et. al.Akihiro Miura ... Hiroshi Hirai
01 Nov 2013
Molecular Cancer Therapeutics | VOL. 12

Classification of High‐Activity Tiagabine Analogs by Binary QSAR Modeling
Andreas Jurik ... Regina Reicherstorfer
Molecular Informatics | VOL. 32
Andreas Jurik, et. al.Andreas Jurik ... Regina Reicherstorfer
15 May 2013
Molecular Informatics | VOL. 32

Abstract C097: Pyrrolo[2′,3′:3,4]cyclohepta[1,2-d][1,2]oxazoles: A new class of antimitotic agents
Virginia Spanò ... Alessandra Montalbano
Molecular Cancer Therapeutics | VOL. 18
Virginia Spanò, et. al.Virginia Spanò ... Alessandra Montalbano
01 Dec 2019
Abstract C097: Pyrrolo[2′,3′:3,4]cyclohepta[1,2-d][1,2]oxazoles: A new class of antimitotic agents
Virginia Spanò ... Alessandra Montalbano

Identifying targetable markers of resistance to dual TORC1/2 inhibition in endometrial cancer cell lines.
Katie Mcgreal ... Jason David
Journal of Clinical Oncology | VOL. 40
Katie Mcgreal, et. al.Katie Mcgreal ... Jason David
01 Jun 2022
Journal of Clinical Oncology | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring the Use of Compound-Induced Transcriptomic Data Generated From Cell Lines to Predict Compound Activity Toward Molecular Targets.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Chemistry