Abstract

With rapid development of high-throughput technique in the field of life science, LncRNAs are found to be inextricably linked to many diseases that seriously endanger human health. However, traditional experiment-based methods for inferring unknown diseases-LncRNA associations are time-consuming and laborious, therefore, it has become an effective way to adopt computational models to predict potential LncRNA-disease associations in recent years. In this article, a novel prediction model called LHNHLDA has been proposed. In LHNHLDA, based on known LncRNA-disease associations downloaded from benchmark databases, a heterogeneous LncRNA-disease network is built first by integrating LncRNA-LncRNA similarities with disease-disease similarities. And then, through adopting the LHN-2 algorithm, the path-based similarities between different nodes in the newly constructed heterogeneous network are obtained, which can be utilized to infer potential associations between LncRNAs and diseases. Finally, in order to evaluate the performance of LHNHLDA, intensive experiments have been done, and experimental results show that LHNHLDA can achieve reliable AUCs of 0.8155, 0.8281 and 0.8569 under the frameworks of 2-Fold CV, 5-Fold CV and LOO-CV respectively. Furthermore, case studies on lung cancer and leukemia illustrate that there are 10 and 7 potential LncRNAs out of the top 10 related LncRNAs of leukemia and lung cancer predicted by LHNHLDA having been confirmed to be linked to these two kinds of diseases by latest studies separately. Hence, due to the satisfactory prediction performance achieved by LHNHLDA, it is obvious that LHNHLDA may be a useful tool for future researches in the field of bioinformatics.

Highlights

  • Human genetic information has long been thought to be stored only in protein-coding genes, which leads RNAs to having long been overlooked by researchers as an intermediate for translating DNAs into proteins [1,2]

  • NcRNAs have been proven to be related to many biological processes in human life [5, 6], and can be classified into long ncRNAs (LncRNAs) and small ncRNAs according to the length of nucleotide during transcription

  • LHNHLDA is compared with KATZLDA, PMFILDA [48], LRLSLDA based on the dataset downloaded from 2017-version of LncRNAdisease database

Read more

Summary

Introduction

Human genetic information has long been thought to be stored only in protein-coding genes, which leads RNAs to having long been overlooked by researchers as an intermediate for translating DNAs into proteins [1,2]. RNAs can be classified as coding RNAs and non-coding RNAs (ncRNAs), where coding RNA is partially transcribed by genes that can encode proteins, which only accounts for less than 2% of the total RNAs, whilst the rest of more than 98% RNAs are ncRNAs that do not have the function of encoding proteins [3, 4]. NcRNAs have been proven to be related to many biological processes in human life [5, 6], and can be classified into long ncRNAs (LncRNAs) and small ncRNAs according to the length of nucleotide during transcription.

Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.