Abstract

Identifying disease-related microRNAs (miRNAs) is crucial to understanding the etiology and pathogenesis of many diseases. However, existing computational methods are facing a few dilemmas such as lacking “negative samples” (i.e. confirmed unrelated miRNA-disease pairs). In this study, we proposed LRMCMDA, a low-rank matrix completion-based method to predict miRNA-disease associations. LRMCMDA firstly constructs a bipartite miRNA-disease graph from known associations and defines its R-projected miRNA graph, in which two miRNAs are connected if they are adjacent to the same disease in the bipartite graph. Similarly, we can define its D-projected disease graph. It then infers negative samples by assuming that connecting an unrelated miRNA-disease pair in the bipartite graph will change its R-projected miRNA graph and D-projected disease graph. Providing with both known miRNA-disease associations and negative samples, LRMCMDA infers associations between all miRNAs and diseases using a low-rank matrix completion model, in which miRNA similarity and disease similarity are incorporated into regularization terms. The assumption is that similar miRNAs will associate with similar diseases and vice versa. We compared LRMCMDA with a few state-of-the-art algorithms on several established miRNA-disease databases. LRMCMDA achieves an AUC of 0.8882 on the 5-fold cross-validation, significantly outperforming canonical methods when predicting miRNA-disease associations, and associating miRNAs with isolated diseases. The experimental results demonstrate that LRMCMDA effectively infers novel miRNA-disease associations. In addition, the case studies on cancers have further proven that LRMCMDA is useful in identifying potential cancer-associated miRNAs for experimental validation.

Highlights

  • As a class of RNAs critical in post-transcriptional gene regulations, microRNAs are short non-coding RNAs of lengths approximately 22 nucleotides [1]–[3]

  • MATERIALS AND METHODS We presented an overview of low-rank matrix completionbased miRNA-disease association (LRMCMDA) in Fig. 1, which mainly consists of 4 steps: LRMCMDA first calculates (1) semantic similarity between diseases and (2) functional similarity between miRNAs; (3) It constructs a miRNA-disease bipartite graph, through which to infer negative samples, i.e. unrelated miRNA-disease pairs; (4) LRMCMDA applies a low-rank matrix completion model integrating miRNA and disease similarities to estimate the associations between unrevealed miRNA-disease pairs and adjust the known ones

  • Considering the limited number of known and experimentally verified miRNA-disease associations, using only area under curve (AUC) to evaluate the performance of the predictive method was too arbitrary

Read more

Summary

Introduction

As a class of RNAs critical in post-transcriptional gene regulations, microRNAs (miRNAs) are short non-coding RNAs of lengths approximately 22 nucleotides [1]–[3]. They can interact with many types of biomolecules including mRNAs, The associate editor coordinating the review of this manuscript and approving it for publication was Vincenzo Conti. Long non-coding RNAs (lncRNAs), and proteins to perform various biological functions. The first discovered miRNA lin-4 plays a critical regulatory role in the nematode larvae development by regulating the expression of its target genes lin-14 and lin-28 [4].

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call