Abstract

BackgroundLong non-coding RNAs (lncRNA) are a major class of non-coding RNAs. They are involved in diverse intra-cellular mechanisms like molecular scaffolding, splicing and DNA methylation. Through these mechanisms they are reported to play a role in cellular differentiation and development. They show an enriched expression in the brain where they are implicated in maintaining cellular identity, homeostasis, stress responses and plasticity. Low sequence conservation and lack of functional annotations make it difficult to identify homologs of mammalian lncRNAs in other vertebrates. A computational evaluation of the lncRNAs through systematic conservation analyses of both sequences as well as their genomic architecture is required.ResultsOur results show that a subset of mouse candidate lncRNAs could be distinguished from random sequences based on their alignment with zebrafish phastCons elements. Using ROC analyses we were able to define a measure to select significantly conserved lncRNAs. Indeed, starting from ~2,800 mouse lncRNAs we could predict that between 4 and 11% present conserved sequence fragments in fish genomes. Gene ontology (GO) enrichment analyses of protein coding genes, proximal to the region of conservation, in both organisms highlighted similar GO classes like regulation of transcription and central nervous system development. The proximal coding genes in both the species show enrichment of their expression in brain. In summary, we show that interesting genomic regions in zebrafish could be marked based on their sequence homology to a mouse lncRNA, overlap with ESTs and proximity to genes involved in nervous system development.ConclusionsConservation at the sequence level can identify a subset of putative lncRNA orthologs. The similar protein-coding neighborhood and transcriptional information about the conserved candidates provide support to the hypothesis that they share functional homology. The pipeline herein presented represents a proof of principle showing that a portion between 4 and 11% of lncRNAs retains region of conservation between mammals and fishes. We believe this study will result useful as a reference to analyze the conservation of lncRNAs in newly sequenced genomes and transcriptomes.

Highlights

  • Long non-coding RNAs are a major class of non-coding RNAs

  • receiver operating characteristic (ROC) like analyses were performed on the results of the following BLASTn searches: 1) mouse Long non-coding RNAs (lncRNA) against zebrafish phastCons elements, 2) shuffled mouse lncRNA sequences against zebrafish phastCons elements

  • Mouse candidates lncRNAs from two sources representing three datasets, were used to determine sequence conservation

Read more

Summary

Introduction

They are involved in diverse intra-cellular mechanisms like molecular scaffolding, splicing and DNA methylation Through these mechanisms they are reported to play a role in cellular differentiation and development. Long non-coding RNAs (lncRNAs) were firstly reported as transcripts expressed in large numbers in mammalian transcriptomes [1,2]. They were shown to constitute more than half of all the transcriptional outputs of mammalian genomes [3]. Prior to these reports Xist in mammals and Xlsirt in amphibians were the only well characterized lncRNAs described to function in Χ; chromosome inactivation and the formation of cytoskeleton [4,5]. There are examples of lncRNAs playing a role in the adaptive immunity of mammals [11], being differentially expressed in response to carcinogens [12] and functioning as enhancers [13]

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.