Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors

Xiuqin Liu,Geir Skogerbø,Runsheng Chen,Shunmin He,Fuzhou Gong,Grzegorz Kudla

doi:10.1371/journal.pone.0032797

Abstract

BackgroundUpwards of 1200 miRNA loci have hitherto been annotated in the human genome. The specific features defining a miRNA precursor and deciding its recognition and subsequent processing are not yet exhaustively described and miRNA loci can thus not be computationally identified with sufficient confidence.ResultsWe rendered pre-miRNA and non-pre-miRNA hairpins as strings of integrated sequence-structure information, and used the software Teiresias to identify sequence-structure motifs (ss-motifs) of variable length in these data sets. Using only ss-motifs as features in a Support Vector Machine (SVM) algorithm for pre-miRNA identification achieved 99.2% specificity and 97.6% sensitivity on a human test data set, which is comparable to previously published algorithms employing combinations of sequence-structure and additional features. Further analysis of the ss-motif information contents revealed strongly significant deviations from those of the respective training sets, revealing important potential clues as to how the sequence and structural information of RNA hairpins are utilized by the miRNA processing apparatus.ConclusionIntegrated sequence-structure motifs of variable length apparently capture nearly all information required to distinguish miRNA precursors from other stem-loop structures.

Highlights

More than 1200 miRNAs have been identified in humans [1]
MiRNAs are processed from longer precursor transcripts, and it is the processing apparatus which decides whether an RNA hairpin structure shall
To test this hypothesis we developed an Support Vector Machine (SVM) algorithm (Mirident), which, when employing the 1300 most informative ss-motifs, was able to predict miRNA loci in the human genome with higher specificity and sensitivity than any other previously published computational tool

Summary

Results

We rendered pre-miRNA and non-pre-miRNA hairpins as strings of integrated sequence-structure information, and used the software Teiresias to identify sequence-structure motifs (ss-motifs) of variable length in these data sets. Using only ss-motifs as features in a Support Vector Machine (SVM) algorithm for pre-miRNA identification achieved 99.2% specificity and 97.6% sensitivity on a human test data set, which is comparable to previously published algorithms employing combinations of sequence-structure and additional features. Further analysis of the ss-motif information contents revealed strongly significant deviations from those of the respective training sets, revealing important potential clues as to how the sequence and structural information of RNA hairpins are utilized by the miRNA processing apparatus

Introduction

Results and Discussion

Materials and Methods

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PLoS ONE	Publication Date: Mar 15, 2012
Citations: 54	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE

Lead the way for us

Similar Papers

Fast-Forward Genetics Identifies Plant CPL Phosphatases as Regulators of miRNA Processing Factor HYL1
Pablo A Manavella ... Detlef Weigel
Cell | VOL. 151
Pablo A Manavella, et. al.Pablo A Manavella ... Detlef Weigel
01 Nov 2012
Cell | VOL. 151

Target classification via support vector machines
Robert E Karlsen
Optical Engineering | VOL. 39
Robert E KarlsenRobert E Karlsen
01 Mar 2000
Optical Engineering | VOL. 39

Study on Gene Splicing Site Recognition Based on Particle Swarm Optimization Twin Support Vector Machine Algorithm for Smart Healthcare
Fuquan Zhang ... Chao-Yang Lee
Wireless Communications and Mobile Computing | VOL. 2023
Fuquan Zhang, et. al.Fuquan Zhang ... Chao-Yang Lee
21 Apr 2023
Wireless Communications and Mobile Computing | VOL. 2023

Determination of Granting Appropriateness Credit at “Daruzzakah Rensing” Cooperative Using the Support Vector Machine (SVM) Algorithm
Yahya ... Nurhidayati
Journal of Physics: Conference Series | VOL. 1539
Yahya, et. al. Yahya ... Nurhidayati
01 May 2020
Journal of Physics: Conference Series | VOL. 1539

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: PLoS ONE