Abstract

The identification of drug-target interactions (DTIs) plays a crucial role in drug discovery. However, the traditional high-throughput techniques based on clinical trials are costly, cumbersome, and time-consuming for identifying DTIs. Hence, new intelligent computational methods are urgently needed to surmount these defects in predicting DTIs. In this paper, we propose a novel computational method that combines position-specific scoring matrix (PSSM), elastic net based sparse features extraction, and rotation forest (RF) classifier. Specifically, we converted each protein primary sequence into PSSM, which contains biological evolutionary information. Then we extract the hidden sparse feature descriptors in PSSM by elastic net based sparse feature extraction method (ESFE). After that, we fuse them with the features of drug, which are represented by molecular fingerprints. Finally, rotation forest classifier works on detecting the potential drug-target interactions. When performing the proposed method by the experiments of fivefold cross validation (CV) on enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor datasets, this method achieves average accuracies of 90.32%, 88.91%, 80.65%, and 79.73%, respectively. We also compared the proposed model with the state-of-the-art support vector machine (SVM) classifier and other effective methods on the same datasets. The comparison results distinctly indicate that the proposed model possesses the efficient and robust ability to predict DTIs. We expect that the new model will be able to take effects on predicting massive DTIs.

Highlights

  • Identification of drug-target interactions (DTIs) plays an increasingly critical part in drug development

  • We proposed a novel computational model combining position-specific scoring matrix (PSSM), elastic net based sparse feature extraction, and rotation forest classifier to identify drug-target interactions. e fivefold cross validation (CV) method comprehensively assessed the prediction ability of the proposed model on the datasets

  • Our model achieves average accuracies of 90.32%, 88.91%, 80.65%, and 79.73% on such datasets as enzyme, ion channel, G protein-coupled receptors (GPCRs), and nuclear receptor

Read more

Summary

Introduction

Identification of DTIs plays an increasingly critical part in drug development. Drug-target interactions guarantee the health promotion by preventing and treating diseases. The databases including DrugBank [3], PubChem [4], erapeutical Target Database (TTD) [5], and ZINC [6] have provided the data of small molecule drugs and biotechnology drugs They provide biological and chemical information such as molecular structures, drug-target interactions, and characteristics of relevant drug [7]. Ding et al [19] proposed a double Laplacian regularized least-squares (DLapRLS) method based on Hilbert-Schmidt independence criterion and multikernel learning (HSIC-MKL) model It builds kernels for multiple information sources and uses alternating least squares to train it. We proposed a novel computational model that combines PSSM, elastic net based sparse feature extraction (ESFE) method, and RF classifier to identify drugtarget identifications.

Materials and Methods
Results and Discussion
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call