Identifying novel genes in C. elegans using SAGE tags

Matthew J Nesbitt,Nansheng Chen,Donald G Moerman

doi:10.1186/1471-2199-11-96

Matthew J Nesbitt, Nansheng Chen + Show 1 more

Open Access

PDF Available

https://doi.org/10.1186/1471-2199-11-96

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

BackgroundDespite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes. This situation is partly explained by the fact that gene prediction programs have been developed based on our incomplete understanding of gene feature information such as splicing and promoter characteristics. Additionally, full-length cDNAs of many genes and their isoforms are hard to obtain due to their low level or rare expression. In order to obtain full-length sequences of all protein-coding genes, alternative approaches are required.ResultsIn this project, we have developed a method of reconstructing full-length cDNA sequences based on short expressed sequence tags which is called sequence tag-based amplification of cDNA ends (STACE). Expressed tags are used as anchors for retrieving full-length transcripts in two rounds of PCR amplification. We have demonstrated the application of STACE in reconstructing full-length cDNA sequences using expressed tags mined in an array of serial analysis of gene expression (SAGE) of C. elegans cDNA libraries. We have successfully applied STACE to recover sequence information for 12 genes, for two of which we found isoforms. STACE was used to successfully recover full-length cDNA sequences for seven of these genes.ConclusionsThe STACE method can be used to effectively reconstruct full-length cDNA sequences of genes that are under-represented in cDNA sequencing projects and have been missed by existing gene prediction methods, but their existence has been suggested by short sequence tags such as SAGE tags.

Highlights

Despite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes
While serial analysis of gene expression (SAGE) tags that correspond to existing gene models can be used to evaluate the abundance of gene expression, there are a large number of SAGE tags that do not correspond to existing gene models
Tag based reconstruction of full-length cDNA sequence of novel genes Expressed sequence tags that cannot be aligned to the C. elegans virtual transcriptome suggest the existence of yet unannotated genes [13,21]

Summary

Introduction

Despite extensive efforts devoted to predicting protein-coding genes in genome sequences, many bona fide genes have not been found and many existing gene models are not accurate in all sequenced eukaryote genomes. The C. elegans gene set is still far from complete for the following reasons: First, because Genefinder, like other gene prediction programs, was developed based on an incomplete understanding of gene structures, it suffers from both false. In this project, we explored how to reconstruct fulllength gene models for genes that are not correctly represented in the current gene set, using expressed sequence tags obtained in large-scale gene expression projects. These SAGE tags suggest the existence of additional coding exons, splice variants [20], or novel genes

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Molecular Biology	Publication Date: Dec 1, 2010
Citations: 6	License type: CC BY 4.0

R Discovery Prime

Identifying novel genes in C. elegans using SAGE tags

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Molecular Biology

Lead the way for us

Similar Papers

Modified PCR methods for 3′ end amplification from serial analysis of gene expression (SAGE) tags
Wang‐Jie Xu ... Zhong‐Dong Qiao
The FEBS Journal | VOL. 276
Wang‐Jie Xu, et. al.Wang‐Jie Xu ... Zhong‐Dong Qiao
27 Apr 2009
The FEBS Journal | VOL. 276

Differentially expressed genes in pancreatic ductal adenocarcinomas identified through serial analysis of gene expression
Steven R Hustinx ... Ralph H Hruban
Cancer Biology & Therapy | VOL. 3
Steven R Hustinx, et. al.Steven R Hustinx ... Ralph H Hruban
01 Dec 2004
Cancer Biology & Therapy | VOL. 3

Comparative serial analysis of gene expression of transcript profiles of tomato roots infected with cyst nematode
Taketo Uehara ... Chikara Masuta
Plant Molecular Biology | VOL. 63
Taketo Uehara, et. al.Taketo Uehara ... Chikara Masuta
16 Sep 2006
Plant Molecular Biology | VOL. 63

Generation of longer cDNA fragments from serial analysis of gene expression tags for gene identification.
Jian-Jun Chen ... San Ming Wang
Proceedings of the National Academy of Sciences | VOL. 97
Jian-Jun Chen, et. al.Jian-Jun Chen ... San Ming Wang
04 Jan 2000
Proceedings of the National Academy of Sciences | VOL. 97

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Identifying novel genes in C. elegans using SAGE tags

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: BMC Molecular Biology