Abstract

MicroRNAs (miRNAs) are small molecular non-coding RNAs that have important roles in the post-transcriptional mechanism of animals and plants. They are commonly 21-25 nucleo- tides (nt) long and derived from 60-90 nt RNA hairpin structures, called miRNA hairpins. A lar- ger number of sequence segments in the human genome have been computationally identified with such 60-90 nt hairpins, however the major- ity of them are not miRNA hairpins. Most exist- ing computational methods for predicting miRNA hairpins are based on a two-class classi- fier to distinguish between miRNA hairpins and other sequence segments with hairpin struc- tures. The difficulty of these methods is how to select hairpins as negative examples of miRNA hairpins in the training dataset, since only a few miRNA hairpins are available. Therefore, these classifiers may be mis-trained due to some false negative examples of the training dataset. In this paper, we introduce a one-class support vector machine (SVM) method to predict miRNA hair- pins among the hairpin structures. Different from existing methods for predicting miRNA hairpins, the one-class SVM classifier is trained only on the information of the miRNA class. We also il- lustrate some examples of predicting miRNA hairpins in human chromosomes 10, 15, and 21, where our method overcomes the above disad- vantages of existing two-class methods.

Highlights

  • MicroRNAs are small, non-coding RNAs (2125 nucleotides in length) that regulate the expression of protein-encoding genes at the post-transcriptional level [1, 2, 21]

  • We present a new method for predicting miRNA hairpins that employs support vector machines

  • The one-class support vector machine (SVM) model should capture the characteristics of known miRNA hairpins

Read more

Summary

Introduction

MicroRNAs (miRNAs) are small, non-coding RNAs (2125 nucleotides in length) that regulate the expression of protein-encoding genes at the post-transcriptional level [1, 2, 21]. The processing and maturation of miRNAs are divided into several steps before silencing their targets. The pre-miRNA is a double-stranded sequence of about 60-90 nt with a 2-nt 3' overhang and forms a hairpin structure ( called miRNA hairpin). The premiRNA is cleaved into an imperfect double-stranded RNA duplex by endonuclease RNase III enzyme called Dicer [25, 29, 42]. This duplex is composed of the mature miRNA strand and its complementary strand. Mature miRNAs are incorporated into RICS (RNAinduced silencing complex) before they bind to their targets to regulate gene expression

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call