Systematic generation and analysis of counterfactuals for compound activity predictions using multi-task models.

Alec Lamens,Jürgen Bajorath

doi:10.1039/d4md00128a

Abstract

Most machine learning (ML) methods produce predictions that are hard or impossible to understand. The black box nature of predictive models obscures potential learning bias and makes it difficult to recognize and trace problems. Moreover, the inability to rationalize model decisions causes reluctance to accept predictions for experimental design. For ML, limited trust in predictions presents a substantial problem and continues to limit its impact in interdisciplinary research, including early-phase drug discovery. As a desirable remedy, approaches from explainable artificial intelligence (XAI) are increasingly applied to shed light on the ML black box and help to rationalize predictions. Among these is the concept of counterfactuals (CFs), which are best understood as test cases with small modifications yielding opposing prediction outcomes (such as different class labels in object classification). For ML applications in medicinal chemistry, for example, compound activity predictions, CFs are particularly intuitive because these hypothetical molecules enable immediate comparisons with actual test compounds that do not require expert ML knowledge and are accessible to practicing chemists. Such comparisons often reveal structural moieties in compounds that determine their predictions and can be further investigated. Herein, we adapt and extend a recently introduced concept for the systematic generation of molecular CFs to multi-task predictions of different classes of protein kinase inhibitors, analyze CFs in detail, rationalize the origins of CF formation in multi-task modeling, and present exemplary explanations of predictions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Systematic generation and analysis of counterfactuals for compound activity predictions using multi-task models.

Abstract

Talk to us

Similar Papers

More From: RSC medicinal chemistry

Lead the way for us

Similar Papers

Artificial intelligence in interdisciplinary life science and drug discovery research.
Jürgen Bajorath
Future science OA | VOL. 8
Jürgen BajorathJürgen Bajorath
08 Mar 2022
Future science OA | VOL. 8

Abstract 1938: Large-scale identification of mutation activity and drug sensitivity in BRAF via a novel multi-label multi-task CNN model
Ilona Kifer ... Arie Aizenman
Cancer Research | VOL. 82
Ilona Kifer, et. al.Ilona Kifer ... Arie Aizenman
15 Jun 2022
Cancer Research | VOL. 82

Entry into a new class of protein kinase inhibitors by pseudo ring design
Pascal Furet ... Thomas Meyer
Bioorganic & Medicinal Chemistry Letters | VOL. 18
Pascal Furet, et. al.Pascal Furet ... Thomas Meyer
14 Jan 2008
Bioorganic & Medicinal Chemistry Letters | VOL. 18

Abstract B246: NVP-BGJ398: A potent and selective inhibitor of the fibroblast growth factor receptor family.
Vito Guagnano ... Tinya Abrams
Molecular Cancer Therapeutics | VOL. 10
Vito Guagnano, et. al.Vito Guagnano ... Tinya Abrams
12 Nov 2011
Abstract B246: NVP-BGJ398: A potent and selective inhibitor of the fibroblast growth factor receptor family.
Vito Guagnano ... Tinya Abrams

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Systematic generation and analysis of counterfactuals for compound activity predictions using multi-task models.

Abstract

Talk to us

Similar Papers

More From: RSC medicinal chemistry