Abstract

MicroRNAs (miRNAs) play crucial roles in human disease and can be targeted by small molecule (SM) drugs according to numerous studies, which shows that identifying SM-miRNA associations in human disease is important for drug development and disease treatment. We proposed the method of Ensemble of Kernel Ridge Regression-based Small Molecule-MiRNA Association prediction (EKRRSMMA) to uncover potential SM-miRNA associations by combing feature dimensionality reduction and ensemble learning. First, we constructed different feature subsets for both SMs and miRNAs. Then, we trained homogeneous base learners based on distinct feature subsets and took the average of scores obtained from these base learners as SM-miRNA association score. In EKRRSMMA, feature dimensionality reduction technology was employed in the process of construction of feature subsets to reduce the influence of noisy data. Besides, the base learner, namely KRR_avg, was the combination of two classifiers constructed under SM space and miRNA space, which could make full use of the information of SM and miRNA. To assess the prediction performance of EKRRSMMA, we conducted Leave-One-Out Cross-Validation (LOOCV), SM-fixed local LOOCV, miRNA-fixed local LOOCV and 5-fold CV based on two datasets. For Dataset 1 (Dataset 2), EKRRSMMA got the Area Under receiver operating characteristic Curves (AUCs) of 0.9793 (0.8871), 0.8071 (0.7705), 0.9732 (0.8586) and 0.9767 ± 0.0014 (0.8560 ± 0.0027). Besides, we conducted four case studies. As a result, 32 (5-Fluorouracil), 19 (17β-Estradiol), 26 (5-Aza-2'-deoxycytidine) and 11 (cyclophosphamide) out of top 50 predicted potentially associated miRNAs were confirmed by database or experimental literature. Above evaluation results demonstrated that EKRRSMMA is reliable for predicting SM-miRNA associations.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call