Abstract

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) have been involved in various biological processes. Emerging evidence suggests that the interactions between lncRNAs and miRNAs play an important role in the regulation of genes and the development of many diseases. Due to the limited scale of known lncRNA-miRNA interactions, and expensive time and labor costs for identifying them by biological experiments, more accurate and efficient lncRNA-miRNA interaction computational prediction approach urgently need to be developed. In this work, we proposed a novel computational model, GNMFLMI, to predict lncRNA-miRNA interactions using graph regularized nonnegative matrix factorization. More specifically, the similarities both lncRNA and miRNA are calculated based on known interaction information and their sequence information. Then, the affinity graphs for lncRNAs and miRNAs are constructed using the $p$ -nearest neighbors, respectively. Finally, a graph regularized nonnegative matrix factorization model is developed to accurately infer potential interactions between lncRNAs and miRNAs. To assess the performance of GNMFLMI, five-fold cross-validation experiments are carried out. The AUC values achieved by GNMFLMI on two datasets are 0.9769 and 0.8894, respectively, which outperform the compared methods. In the case studies for lncRNA nonhsat159254.1 and miRNA hsa-mir-544a, 20 and 16 of the top-20 associations predicted by GNMFLMI are confirmed, respectively. Rigorous experimental results demonstrate that GNMFLMI can effectively predict novel lncRNA-miRNA interactions, which can provide guidance for relevant biomedical research. The source code of GNMFLMI is freely available at https://github.com/haichengyi/GNMFLMI .

Highlights

  • With the development of next-generation sequencing, specific biological mechanisms can be better understood from the wide-ranging biomolecular interactions in the genome

  • The Long non-coding RNAs (lncRNAs)-miRNA interactions adjacency matrix Y ∈ Rr×n was constructed based on lncRNASNP2 database, where r is the number of lncRNAs, n is the number of miRNAs

  • MiRNA interactions, the five-fold cross validation experiments were performed on the lncRNASNP2 dataset and compare our method with the following approaches: nonnegative matrix factorization (NMF) and RNMF

Read more

Summary

Introduction

With the development of next-generation sequencing, specific biological mechanisms can be better understood from the wide-ranging biomolecular interactions in the genome. Long non-coding RNAs (lncRNAs) and microRNAs (miRNAS) were previously thought to be non-functional sequences in the process of gene evolution [1]. They play an important role in cell differentiation, somatic development and other life processes, and can participate in the occurrence of disease through interaction [1]. As a new focus of regulation for gene expression, lncRNA plays a biological role mainly through signal function, bait function, scaffold function and guiding function [9]. Even though the experiment has identified more than 58 000 human lncRNA genes, Only a few lncRNAs have been functionally characterized, such as H19, HOTAIR and Malat, Most of them are still functionally uncharacterized [10]

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call