Abstract

BackgroundG protein-coupled receptors (GPCRs) mediate a variety of important physiological functions, are closely related to many diseases, and constitute the most important target family of modern drugs. Therefore, the research of GPCR analysis and GPCR ligand screening is the hotspot of new drug development. Accurately identifying the GPCR-drug interaction is one of the key steps for designing GPCR-targeted drugs. However, it is prohibitively expensive to experimentally ascertain the interaction of GPCR-drug pairs on a large scale. Therefore, it is of great significance to predict the interaction of GPCR-drug pairs directly from the molecular sequences. With the accumulation of known GPCR-drug interaction data, it is feasible to develop sequence-based machine learning models for query GPCR-drug pairs.ResultsIn this paper, a new sequence-based method is proposed to identify GPCR-drug interactions. For GPCRs, we use a novel bag-of-words (BoW) model to extract sequence features, which can extract more pattern information from low-order to high-order and limit the feature space dimension. For drug molecules, we use discrete Fourier transform (DFT) to extract higher-order pattern information from the original molecular fingerprints. The feature vectors of two kinds of molecules are concatenated and input into a simple prediction engine distance-weighted K-nearest-neighbor (DWKNN). This basic method is easy to be enhanced through ensemble learning. Through testing on recently constructed GPCR-drug interaction datasets, it is found that the proposed methods are better than the existing sequence-based machine learning methods in generalization ability, even an unconventional method in which the prediction performance was further improved by post-processing procedure (PPP).ConclusionsThe proposed methods are effective for GPCR-drug interaction prediction, and may also be potential methods for other target-drug interaction prediction, or protein-protein interaction prediction. In addition, the new proposed feature extraction method for GPCR sequences is the modified version of the traditional BoW model and may be useful to solve problems of protein classification or attribute prediction. The source code of the proposed methods is freely available for academic research at https://github.com/wp3751/GPCR-Drug-Interaction.

Highlights

  • G protein-coupled receptors (GPCRs) mediate a variety of important physiological functions, are closely related to many diseases, and constitute the most important target family of modern drugs

  • Because the importance and particularity of GPCRs, we focus the study in the computational approach for identifying GPCR-drug interaction only based on sequence information

  • Firstly, we fix the hyperparameter in the prediction engine distance-weighted Knearest-neighbor (DWKNN), and compare the prediction performance with different representations of GPCRs and drugs

Read more

Summary

Introduction

G protein-coupled receptors (GPCRs) mediate a variety of important physiological functions, are closely related to many diseases, and constitute the most important target family of modern drugs. As the largest family of human membrane protein, GPCRs mediate multiple physiological processes such as neurotransmission, cellular metabolism, secretion, cellular differentiation, growth, inflammatory, and immune responses [1, 2] As a result, these receptors have emerged as the most important drug targets in human pathophysiology [3, 4]. With the breakthroughs in GPCR crystallography [9, 10], the structure-based methods are potentially impactful for GPCR-targeted drug design [11,12,13] As far as it goes, sequence-based methods may be an easy and efficient choice owing to machine learning technology and the accumulation of target-drug interaction data stored in KEGG [14], SuperTarget [15], DrugBank [16] and so on

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call