Abstract
BackgroundmicroRNAs are a class of small RNAs, about 20 nt long, which regulate cellular processes in animals and plants. Identifying microRNAs is one of the most important tasks in gene regulation studies. The main features used for identifying these tiny molecules are those in hairpin secondary structures of pre-microRNA.ResultsA new classifier is employed to identify precursor microRNAs from both pseudo hairpins and other non-coding RNAs. This classifier achieves a geometric mean Gm = 92.20% with just three features and 92.91% with seven features.ConclusionThis study shows that linear dimensionality reduction combined with explicit feature mapping, namely miLDR-EM, achieves high performance in classification of microRNAs from other sequences. Also, explicitly mapping data onto a high dimensional space could be a useful alternative to kernel-based methods for large datasets with a small number of features. Moreover, we demonstrate that microRNAs can be accurately identified by just using three properties that involve minimum free energy.
Highlights
MicroRNAs are a class of small RNAs, about 20 nt long, which regulate cellular processes in animals and plants
Negative dataset Pseudo hairpins: The negative dataset is composed of 8,494 human pseudo hairpin sequences which were previously used in Triplet-support vector machine (SVM), MiPred, miPred and microPred
The mapping is applied to the dataset with pairs of features, using the polynomial and radial basis function (RBF) mapping functions
Summary
MicroRNAs are a class of small RNAs, about 20 nt long, which regulate cellular processes in animals and plants. Identifying microRNAs is one of the most important tasks in gene regulation studies. MicroRNAs are a class of small non-coding RNAs that play a crucial role in gene regulation by perfectly or imperfectly binding into three prime untranslated regions (3’ UTR) in messenger RNAs, and cause repression of translating mRNAs into proteins or their cleavage. MicroRNAs were only identified by using experimental methods. Traditional experimental approaches to microRNA discovery include cloning and sequencing [3], and can detect novel microRNAs. Since microRNAs usually express at low levels and depend on tissue and conditions of the cell, these methods may be unable to identify new microRNAs [1]. In particular, 454 sequencing, have become popular for discovering new microRNAs [4]
Published Version (
Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have