Abstract

The important role of non coding RNAs (ncRNAs) in the cell has made their identification a critical issue in the biological research. However, traditional approaches such as PT-PCR and Northern Blot are costly. With recent progress in bioinformatics and computational prediction technology, the discovery of ncRNAs has become realistically possible. This paper aims to introduce major computational approaches in the identification of ncRNAs, including homologous search, de novo prediction and mining in deep sequencing data. Furthermore, related software tools have been compared and reviewed along with a discussion on future improvements.

Highlights

  • In the early biological researches, scientists mainly focused on the prokaryotes, which are dominated (85%90%) by protein-coding genes [1] and it is publicly considered that the cellular activities are implemented by the proteins which are transcribed from those coding genes

  • It is estimated that 98% of mammalian genomic output may be non-coding RNAs, while the remaining 2% encodes the proteins [2]

  • Non-coding RNAs are involved in translation, splicing, gene regulation, chromatin remodeling, gene modification, degradation and other functions [61]

Read more

Summary

INTRODUCTION

In the early biological researches, scientists mainly focused on the prokaryotes, which are dominated (85%90%) by protein-coding genes [1] and it is publicly considered that the cellular activities are implemented by the proteins which are transcribed from those coding genes. Non-coding RNAs are involved in translation, splicing, gene regulation, chromatin remodeling, gene modification, degradation and other functions [61] They are closely associated with cancer and other complex diseases [4]. Non-coding RNAs are recognized only in biological experiments with technologies such as full-length complementary DNA cloning and genomic tiling arrays in the transcriptomes of organisms These technologies can suit long ncRNA (lncRNA) genes in an efficient way, they are costly always requiring enough RNA samples, and are limited. Different functions are induced by diverse ncRNA structures and there are variations in ncRNA length They are different from protein-coding genes which have a lot of common conserved features of primary sequence, including splice cites, promoters, terminator and binding motifs etc. We try to introduce most currently existing approaches about the identification of ncRNAs

METHODS
Methods
Findings
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.