Abstract

Drugs are an important way to treat various diseases. However, they inevitably produce side effects, bringing great risks to human bodies and pharmaceutical companies. How to predict the side effects of drugs has become one of the essential problems in drug research. Designing efficient computational methods is an alternative way. Some studies paired the drug and side effect as a sample, thereby modeling the problem as a binary classification problem. However, the selection of negative samples is a key problem in this case. In this study, a novel negative sample selection strategy was designed for accessing high-quality negative samples. Such strategy applied the random walk with restart (RWR) algorithm on a chemical-chemical interaction network to select pairs of drugs and side effects, such that drugs were less likely to have corresponding side effects, as negative samples. Through several tests with a fixed feature extraction scheme and different machine-learning algorithms, models with selected negative samples produced high performance. The best model even yielded nearly perfect performance. These models had much higher performance than those without such strategy or with another selection strategy. Furthermore, it is not necessary to consider the balance of positive and negative samples under such a strategy.

Highlights

  • Drugs are always special products for the treatment of various diseases

  • A drug network was constructed according to the chemical-chemical interaction (CCI) information retrieved from STITCH [18, 19]

  • The negative sample selection strategy applied the random walk with restart (RWR) algorithm to the drug network and extracted negative samples according to the threshold ε of the probability

Read more

Summary

Introduction

Drugs are always special products for the treatment of various diseases. a drug is a double-edged sword; it can bring some unexpected negative effects, usually called side effects, when it produces therapeutic effects. It is a good idea to build a uniform frame to predict side effects of given drugs These models are always complex and have high computational complexity. Some studies proposed a uniform binary classification model for predicting drug side effects [14,15,16,17]. They deemed the pairs of drugs and side effects as samples. A refined negative sample selection strategy was proposed to select high-quality negative samples To this end, a drug network was constructed according to the chemical-chemical interaction (CCI) information retrieved from STITCH [18, 19]. The proportion of positive and negative samples was not a problem when our negative sample selection strategy was used

Materials and Methods
Results and Discussion
Conclusions
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call