Bioinformatic Challenges of Big Data in Non-Coding RNA Research

Christina H Liu,Da-Yu Wu,Jonathan D Pollock

doi:10.3389/fgene.2012.00178

Abstract

Bioinformatic Challenges of Big Data in Non-Coding RNA Research

Highlights

Prior to the high-throughput sequencing techniques, computational programs were developed to search for new miRNAs based on attainable sequence data. These methods used one of the following approaches (Mendes et al, 2009): filterbased approaches, which identified small high-quality sets of conserved miRNA candidates; machine learning methods, which determined initial set of candidates with stem-loops structures, and target-centered approaches, which identify short conserved motifs in the 3′UTRs of protein-coding genes (Xie et al, 2005). Even though these algorithms were developed before the highthroughput sequencing era, they establish strong bases for bioinformatic analyses of big sequencing data; new nonprotein-coding RNA (ncRNA) and targets continue to be cataloged into many databases with sufficient annotations available to the public
High-throughput sequencing techniques and deep sequencing have offered much improved avenue for ncRNA discovery (Lu et al, 2005), by searching genomic sequences for evidence of hairpin structures and determine if sequencing read aligned to these structures mimic miRNA processing byproducts (Friedlander et al, 2008), or using a regularized
Because of the high sensitivity of the technique, the “raw” data will contain sequencing primers and contaminants which can potentially produce sequence bias that requires more sophisticated computational approaches to sieve out miRNA transcripts (Mendes et al, 2009) and cross-platform validations

Summary

Introduction

Prior to the high-throughput sequencing techniques, computational programs were developed to search for new miRNAs based on attainable sequence data. These methods used one of the following approaches (Mendes et al, 2009): filterbased approaches, which identified small high-quality sets of conserved miRNA candidates; machine learning methods, which determined initial set of candidates with stem-loops structures, and target-centered approaches, which identify short conserved motifs in the 3′UTRs of protein-coding genes (Xie et al, 2005).

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Frontiers in Genetics	Publication Date: Jan 1, 2012
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Bioinformatic Challenges of Big Data in Non-Coding RNA Research

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics

Lead the way for us

Similar Papers

Commercial high-throughput sequencing and its applications in DNA analysis
Hai Peng ... Jing Zhang
Biologia | VOL. 64
Hai Peng, et. al.Hai Peng ... Jing Zhang
30 Jan 2009
Biologia | VOL. 64

Immune repertoire high-throughput sequence analysis (IRAS) web service (65.20)
Chunlin Wang ... Qunying Yang
The Journal of Immunology | VOL. 186
Chunlin Wang, et. al.Chunlin Wang ... Qunying Yang
01 Apr 2011
Immune repertoire high-throughput sequence analysis (IRAS) web service (65.20)
Chunlin Wang ... Qunying Yang

Technological and computational advances driving high-throughput oncology.
Leonie Kolmar ... Alexis Autour
Trends in Cell Biology | VOL. 32
Leonie Kolmar, et. al.Leonie Kolmar ... Alexis Autour
01 Nov 2022
Trends in Cell Biology | VOL. 32

Editorial: Bioinformatics of Non-Coding RNAs with Applications to Biomedicine: Recent Advances and Open Challenges.
Alessandro Laganà ... Alfredo Ferro
Frontiers in bioengineering and biotechnology | VOL. 3
Alessandro Laganà, et. al.Alessandro Laganà ... Alfredo Ferro
08 Oct 2015
Frontiers in bioengineering and biotechnology | VOL. 3

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bioinformatic Challenges of Big Data in Non-Coding RNA Research

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Frontiers in Genetics