Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach

Esteban Marcellin,Tim R Mercer,Robin W Palfreyman,Cuauhtemoc Licona-Cassani,Lars K Nielsen

doi:10.1186/1471-2164-14-699

Esteban Marcellin, Tim R Mercer + Show 3 more

Open Access

https://doi.org/10.1186/1471-2164-14-699

Copy DOI

Journal: BMC Genomics	Publication Date: Jan 1, 2013
Citations: 64	License type: cc-by

Affiliation: University of Queensland

Abstract

BackgroundAccurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Annotations based only on in silico predictions are inaccurate, particularly for large, high G + C content genomes due to the lack of similarities in gene length and gene organization to model organisms.ResultsHere we describe a 2D systems biology driven re-annotation of the Saccharopolyspora erythraea genome using proteogenomics, a genome-scale metabolic reconstruction, RNA-sequencing and small-RNA-sequencing. We observed transcription of more than 300 intergenic regions, detected 59 peptides in intergenic regions, confirmed 164 open reading frames previously annotated as hypothetical proteins and reassigned function to open reading frames using the genome-scale metabolic reconstruction. Finally, we present a novel way of mapping ribosomal binding sites across the genome by sequencing small RNAs.ConclusionsThe work presented here describes a novel framework for annotation of the Saccharopolyspora erythraea genome. Based on experimental observations, the 2D annotation framework greatly reduces errors that are commonly made when annotating large-high G + C content genomes using computational prediction algorithms.

Highlights

Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering
Nielsen et al found that 60% of the annotated bacterial genomes contain substantial errors in start/stop codons predictions and are generally over-annotated due to a lack of thorough analysis between computationally assigned open reading frames (ORFs) and real genes [7]
Errors in annotation are abundant in large, high G + C content genomes, where gene length and gene organization vary significantly from wellannotated model organisms such as Escherichia coli, Saccharomyces cerevisiae or Bacillus subtilis

Summary

Introduction

Accurate bacterial genome annotations provide a framework to understanding cellular functions, behavior and pathogenicity and are essential for metabolic engineering. Nielsen et al found that 60% of the annotated bacterial genomes contain substantial errors in start/stop codons predictions and are generally over-annotated due to a lack of thorough analysis between computationally assigned open reading frames (ORFs) and real genes [7]. This observation has been acknowledge by the National Centre for Biotechnology Information (NCBI), which is constantly developing. These long ORFs contain a large numbers of potential start codons that lead to a considerable drop in accuracy of the translation initiation site prediction and tend to predict too many genes [9]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics

Lead the way for us

Similar Papers

Modeling and analysis of flux distribution and bioproduct formation in Synechocystis sp. PCC 6803 using a new genome-scale metabolic reconstruction
Chintan J Joshi ... Ashok Prasad
Algal Research | VOL. 27
Chintan J Joshi, et. al.Chintan J Joshi ... Ashok Prasad
30 Sep 2017
Algal Research | VOL. 27

Genomewide analysis of nucleosome density histone acetylation and HDAC function in fission yeast
Marianna Wirén ... Karl Ekwall
The EMBO Journal | VOL. 24
Marianna Wirén, et. al.Marianna Wirén ... Karl Ekwall
04 Aug 2005
The EMBO Journal | VOL. 24

An Educational Bioinformatics Project to Improve Genome Annotation.
Zoie Amatore ... Susan Gunn
Frontiers in Microbiology | VOL. 11
Zoie Amatore, et. al.Zoie Amatore ... Susan Gunn
07 Dec 2020
Frontiers in Microbiology | VOL. 11

A workflow to identify novel proteins based on the direct mapping of peptide-spectrum-matches to genomic locations
John Anders ... Nico Jehmlich
BMC Bioinformatics | VOL. 22
John Anders, et. al.John Anders ... Nico Jehmlich
26 May 2021
BMC Bioinformatics | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Re-annotation of the Saccharopolyspora erythraea genome using a systems biology approach

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Genomics