Abstract

BackgroundA major effort of the scientific community has been to obtain complete pictures of the genomes of many organisms. This has been accomplished mainly by annotation of structural and functional elements in the genome sequence, a process that has been centred in the gene concept and, as a consequence, biased toward protein coding sequences. Recently, the explosion of transcriptome data generated and the discovery of many functional non-protein coding RNAs have painted a more detailed and complex scenario for the genome. Here we analyzed the mouse carboxypeptidase M locus in this broader perspective in order to define the mouse CPM gene structure and evaluate the existence of other transcripts from the same genomic region.ResultsBioinformatic analysis of nucleotide sequences that map to the mouse CPM locus suggests that, in addition to the mouse CPM mRNA, it expresses at least 33 different transcripts, many of which seem to be non-coding RNAs. We randomly chose to evaluate experimentally four of these extra transcripts. They are expressed in a tissue specific manner, indicating that they are not artefacts or transcriptional noise. Furthermore, one of these four extra transcripts shows expression patterns that differed considerably from the other ones and from the mouse CPM gene, suggesting that there may be more than one transcriptional unit in this locus. In addition, we have confirmed the mouse CPM gene RefSeq sequence by rapid amplification of cDNA ends (RACE) and directional cloning.ConclusionThis study supports the recent view that the majority of the genome is transcribed and that many of the resulting transcripts seem to be non-coding RNAs from introns of genes or from independent transcriptional units. Although some of the information on the transcriptome of many organisms may actually be artefacts or transcriptional noise, we argue that it can be experimentally evaluated and used to find and define biological functional elements on the genome. Furthermore, the transcription of other functional RNAs besides the protein coding RNA from a specific genomic locus imposes extra care when designing and interpreting experiments involving genetic manipulations or expression detection and quantification.

Highlights

  • A major effort of the scientific community has been to obtain complete pictures of the genomes of many organisms

  • Bioinformatic analysis of the mouse carboxypeptidase M (CPM) locus and its transcripts The mouse CPM locus is located in the 10D2 region of the mouse chromosome 10, between the Mdm2 gene [GenBank – Gene ID: 17246] and the annotated mRNA for kinesinrelated protein KIFC5C [GenBank: AF221104], which is a possible pseudogene of the kinesin family member C1 [GenBank – GeneID: 16580] that maps to the intergenic region between the Cpsf6 [GenBank – GeneID: 432508] and CPM gene at about 35 kb upstream from the 5'end of the CPM gene (Figure 1A)

  • In addition to the CPM gene, we defined at least 33 other possible mouse CPM locus Transcripts after analysing a total of 237 sequences that maps to this locus (Figure 1A): 38 CAGE-tags and 199 cDNA sequences between mRNAs and ESTs from the mouse CPM GenBank – UniGene [22] entry Mm.339332 and other cDNA sequences deposited in the GenBank

Read more

Summary

Introduction

A major effort of the scientific community has been to obtain complete pictures of the genomes of many organisms This has been accomplished mainly by annotation of structural and functional elements in the genome sequence, a process that has been centred in the gene concept and, as a consequence, biased toward protein coding sequences. It is believed that this enzyme plays important roles in the processing of many peptide hormones, especially during inflammation and macrophage activation [4,5]. Both CPM activity and protein levels greatly increase in response to inflammatory stimuli [6,7]. The same experimental strategy has not been implemented to resolve the mouse CPM gene [GenBank – GeneID: 70574], it has been defined and annotated on the mouse chromosome 10 mainly on basis of an in silico analysis of the mouse transcriptome and genome using the nucleotide sequence data on the public databases as well as in comparison with the human CPM gene sequence

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call