Abstract

MicroRNAs (miRNAs) are 19 to 25 nucleotides non-coding RNAs known to possess important post-transcriptional regulatory functions. Identifying targeting genes that miRNAs regulate are important for understanding their specific biological functions. Usually, miRNAs down-regulate target genes through binding to the complementary sites in the 3' untranslated region (UTR) of the targets. In part, due to the large number of miRNAs and potential targets, an experimental based prediction design would be extremely laborious and economically unfavorable. However, since the bindings of the animal miRNAs are not a perfect one-to-one match with the complementary sites of their targets, it is difficult to predict targets of animal miRNAs by accessing their alignment to the 3' UTRs of potential targets. Consequently, sophisticated computational approaches for miRNA target prediction are being considered as essential methods in miRNA research.We surveyed most of the current computational miRNA target prediction algorithms in this paper. Particularly, we provided a mathematical definition and formulated the problem of target prediction under the framework of statistical classification. Moreover, we summarized the features of miRNA-target pairs in target prediction approaches and discussed these approaches according to two categories, which are the rule-based and the data-driven approaches. The rule-based approach derives the classifier mainly on biological prior knowledge and important observations from biological experiments, whereas the data driven approach builds statistic models using the training data and makes predictions based on the models. Finally, we tested a few different algorithms on a set of experimentally validated true miRNA-target pairs [1] and a set of false miRNA-target pairs, derived from miRNA overexpression experiment [2]. Receiver Operating Characteristic (ROC) curves were drawn to show the performances of these algorithms.

Highlights

  • In classical molecular biology, the functional units in a genome are genes or the DNA regions that code proteins

  • MicroRNAs are a class of single-stranded non-coding RNAs with about 19 to 25 nucleotides in length, which are mostly known to inhibit the translation of mRNAs into proteins or promote repression of mRNA expression [3, 4]

  • DIANA-microT retrieves orthologous human and mouse 3' untranslated region (UTR) from mRNA Reference Sequences (RefSeq) database and 94 miRNAs conserved in human and mouse

Read more

Summary

INTRODUCTION

The functional units in a genome are genes or the DNA regions that code proteins. Given the importance of the topic, we provided a timely survey of the computational algorithms for miRNA target prediction in this paper. New data-driven prediction algorithms emerge along with the improving knowledge of miRNA target recognition and the increasing availability of various types of relevant data sets. We are aware that there exists good surveys on miRNA target prediction including articles [39,40,41,42,43,44], each addressing the survey from a different perspective Their coverage and depth are adequate for the intended audience, they lack the discussions of issues closer to the computation community.

PRINCIPLES OF miRNA TARGET RECOGNITION AND PREDICTION ONLINE SOURCE
Definition and Problem Formulation
Important Features in miRNA Target Prediction
Seed Region Match
Conservation
Free Energy
In-Site Features
Accessibility Energy
Rule Based Algorithm
TargetScan and TargetScanS
RNAhybrid
MicroInspector
MovingTargets
PicTar
SVMicro
TargetBoost
Algorithm Using Expression Level Data
PERFORMANCE COMPARISON OF DIFFERENT ALGORITHMS
Findings
CONCLUSION
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.