TranScreen: Transfer Learning on Graph-Based Anti-Cancer Virtual Screening Model

Milad Salem,Julia Webb,Aminollah Khormali,Arash Keshavarzi Arshadi,Jiann-Shiun Yuan

doi:10.3390/bdcc4030016

Milad Salem, Julia Webb + Show 3 more

Open Access

https://doi.org/10.3390/bdcc4030016

Copy DOI

Journal: Big Data and Cognitive Computing	Publication Date: Jun 29, 2020
Citations: 13	License type: CC BY 4.0

Affiliation: University of Central Florida

Abstract

Deep learning’s automatic feature extraction has proven its superior performance over traditional fingerprint-based features in the implementation of virtual screening models. However, these models face multiple challenges in the field of early drug discovery, such as over-training and generalization to unseen data, due to the inherently unbalanced and small datasets. In this work, the TranScreen pipeline is proposed, which utilizes transfer learning and a collection of weight initializations to overcome these challenges. An amount of 182 graph convolutional neural networks are trained on molecular source datasets and the learned knowledge is transferred to the target task for fine-tuning. The target task of p53-based bioactivity prediction, an important factor for anti-cancer discovery, is chosen to showcase the capability of the pipeline. Having trained a collection of source models, three different approaches are implemented to compare and rank them for a given task before fine-tuning. The results show improvement in performance of the model in multiple cases, with the best model increasing the area under receiver operating curve ROC-AUC from 0.75 to 0.91 and the recall from 0.25 to 1. This improvement is vital for practical virtual screening via lowering the false negatives and demonstrates the potential of transfer learning. The code and pre-trained models are made accessible online.

Highlights

Drug development is a long and costly process during which a drug candidate is discovered and widely tested to be both efficient and safe
Graph convolutional neural networks have improved the accuracy of virtual screening models, yet face the challenge of imbalanced, non-diverse, and small training datasets
Transfer learning is utilized from 182 source models trained on the MoleculeNet database

Summary

Introduction

Drug development is a long and costly process during which a drug candidate is discovered and widely tested to be both efficient and safe. Molecular descriptors and fingerprints are used to extract features from the input molecules, which are passed to a machine learning model for training. This pipeline has been used for many virtual screening tasks such as kinase inhibition prediction [3], side-effect prediction [4], cytotoxicity prediction [5], and anti-cancer agent prediction [6].

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

TranScreen: Transfer Learning on Graph-Based Anti-Cancer Virtual Screening Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data and Cognitive Computing

Lead the way for us

Similar Papers

Multiscale patch-based feature graphs for image classification
Matheus V Todescato ... Joel L Carbonera
Expert Systems with Applications | VOL. 235
Matheus V Todescato, et. al.Matheus V Todescato ... Joel L Carbonera
08 Aug 2023
Expert Systems with Applications | VOL. 235

Deep convolutional neural networks with ensemble learning and transfer learning for capacity estimation of lithium-ion batteries
Sheng Shen ... Chao Hu
Applied Energy | VOL. 260
Sheng Shen, et. al.Sheng Shen ... Chao Hu
16 Dec 2019
Applied Energy | VOL. 260

Comparative analysis of deep convolution neural network models on small scale datasets
C.R Edwin Selva Rex ... J Jenifer Jose
Optik | VOL. 271
C.R Edwin Selva Rex, et. al.C.R Edwin Selva Rex ... J Jenifer Jose
13 Nov 2022
Optik | VOL. 271

Addressing data scarcity in protein fitness landscape analysis: A study on semi-supervised and deep transfer learning techniques
José A Barbero-Aparicio ... José F Díez-Pastor
Information Fusion | VOL. 102
José A Barbero-Aparicio, et. al.José A Barbero-Aparicio ... José F Díez-Pastor
22 Sep 2023
Information Fusion | VOL. 102

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TranScreen: Transfer Learning on Graph-Based Anti-Cancer Virtual Screening Model

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Big Data and Cognitive Computing