Accessing the SEED Genome Databases via Web Services API: Tools for Programmers

Terry Disz,Daniel Cuevas,Robert Olson,Ross Overbeek,Veronika Vonstein,Sajia Akhter,Robert A Edwards,Rick Stevens

doi:10.1186/1471-2105-11-319

Terry Disz, Daniel Cuevas + Show 6 more

Open Access

https://doi.org/10.1186/1471-2105-11-319

Copy DOI

Abstract

BackgroundThe SEED integrates many publicly available genome sequences into a single resource. The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes. The backend is used as the foundation for many genome annotation tools, such as the Rapid Annotation using Subsystems Technology (RAST) server for whole genome annotation, the metagenomics RAST server for random community genome annotations, and the annotation clearinghouse for exchanging annotations from different resources. In addition to a web user interface, the SEED also provides Web services based API for programmatic access to the data in the SEED, allowing the development of third-party tools and mash-ups.ResultsThe currently exposed Web services encompass over forty different methods for accessing data related to microbial genome annotations. The Web services provide comprehensive access to the database back end, allowing any programmer access to the most consistent and accurate genome annotations available. The Web services are deployed using a platform independent service-oriented approach that allows the user to choose the most suitable programming platform for their application. Example code demonstrate that Web services can be used to access the SEED using common bioinformatics programming languages such as Perl, Python, and Java.ConclusionsWe present a novel approach to access the SEED database. Using Web services, a robust API for access to genomics data is provided, without requiring large volume downloads all at once. The API ensures timely access to the most current datasets available, including the new genomes as soon as they come online.

Highlights

The (The database and infrastructure for comparative genomics) (SEED) integrates many publicly available genome sequences into a single resource
The SEED currently contains over 850 Bacterial genomes that have been completely sequenced (Table 1; The SEED contains many hundreds of draft genomes.) For several years it has been realized that the most efficient and accurate way of annotating these genomes is not by considering each in isolation, but by comparing them all together in unified integration platforms [1]
The SEED platform provides the underpinnings to several common microbial genome annotation services (Fig.1)

Summary

Introduction

The SEED integrates many publicly available genome sequences into a single resource. The database contains accurate and up-to-date annotations based on the subsystems concept that leverages clustering between genomes and other clues to accurately and efficiently annotate microbial genomes. The SEED http://www.theseed.org/ contains all publicly available genome sequences. The Rapid Annotation using Subsystem Technology (RAST server) provides high throughput accurate annotations for complete microbial genomes [6,7]. The development of the RAST server for complete microbial genome annotation provides consistent and accurate annotations, automatic connections to metabolic reconstructions, and detailed comparative genomics tools pre-

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 14, 2010
Citations: 134	License type: cc-by

R Discovery Prime

R Discovery Prime

Accessing the SEED Genome Databases via Web Services API: Tools for Programmers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

High-quality genome sequence assembly of R.A73 Enterococcus faecium isolated from freshwater fish mucus
Rim El Jeni ... Balkiss Bouhaouala-Zahar
BMC microbiology | VOL. 20
Rim El Jeni, et. al.Rim El Jeni ... Balkiss Bouhaouala-Zahar
23 Oct 2020
BMC microbiology | VOL. 20

PHASTEST: faster than PHASTER, better than PHAST.
David S Wishart ... Sukanta Saha
Nucleic Acids Research | VOL. 51
David S Wishart, et. al.David S Wishart ... Sukanta Saha
17 May 2023
Nucleic Acids Research | VOL. 51

MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects
Carson Holt ... Mark Yandell
BMC Bioinformatics | VOL. 12
Carson Holt, et. al.Carson Holt ... Mark Yandell
01 Dec 2011
BMC Bioinformatics | VOL. 12

Genome Sequence Resource of Bacillus velezensis Strain HC-8, a Native Bacterial Endophyte with Biocontrol Potential Against the Honeysuckle Powdery Mildew Causative Pathogen Erysiphe lonicerae var. lonicerae.
Wenyan Cui ... Pengjie He
Molecular plant-microbe interactions : MPMI | VOL. 35
Wenyan Cui, et. al.Wenyan Cui ... Pengjie He
13 Jul 2022
Molecular plant-microbe interactions : MPMI | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Accessing the SEED Genome Databases via Web Services API: Tools for Programmers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics