Abstract

MiRNAs have been widely studied due to their important post-transcriptional regulatory roles in gene expression. Many reports have demonstrated the evidence of miRNA isoform products (isomiRs) in high-throughput small RNA sequencing data. However, the biological function involved in these molecules is still not well investigated. Here, we developed a Shannon entropy-based model to estimate isomiR expression profiles of high-throughput small RNA sequencing data extracted from miRBase webserver. By using the Kolmogorov-Smirnov statistical test (KS test), we demonstrated that the 5p and 3p miRNAs present more variants than the single arm miRNAs. We also found that the isomiR variant, except the 3’ isomiR variant, is strongly correlated with Minimum Free Energy (MFE) of pre-miRNA, suggesting the intrinsic feature of pre-miRNA should be one of the important factors for the miRNA regulation. The functional enrichment analysis showed that the miRNAs with high variation, particularly the 5’ end variation, are enriched in a set of critical functions, supporting these molecules should not be randomly produced. Our results provide a probabilistic framework for miRNA isoforms analysis, and give functional insights into pre-miRNA processing.

Highlights

  • MiRNAs are ~22 nt endogenous small non-coding RNAs, mediating the translation repression or trigger degradation by paring with target mRNAs in post-translational regulation to control gene expression [1,2]

  • We calculated MIH values using the small RNA sequences extracted from miRBase website

  • By comparing MIH and length with Minimum Free Energy (MFE) extracted from miRBase, we found the MFE and length of pre-miRNAs should have a great contribution to the isomiR variants (Fig. 3A and B)

Read more

Summary

Introduction

MiRNAs are ~22 nt endogenous small non-coding RNAs, mediating the translation repression or trigger degradation by paring with target mRNAs in post-translational regulation to control gene expression [1,2]. Advances in next-generation sequencing (NGS) technology are giving rise to a fast accumulation of known miRNAs. In the lasted miRBase version, the human genome encodes for over 1,500 miRNAs [3]. A mature miRNA commences from the genome as a primary miRNA transcript (pri-miRNA) via RNA polymerase II-mediated transcription. Together with DGCR8, the nuclear RNase III-type protein Drosha cleaves the pri-miRNA to release the precursor miRNA (pre-miRNA), a hairpin-like secondary structure. With the exportin 5-dependent pathway, the PLOS ONE | DOI:10.1371/journal.pone.0118856. With the exportin 5-dependent pathway, the PLOS ONE | DOI:10.1371/journal.pone.0118856 March 18, 2015

Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call