The Proteins API: accessing key integrated protein and genome information.

Andrew Nightingale,Maria Martin,Emanuele Alpi,Guoying Qi,Edd Turner,Jie Luo,Leonardo Gonzales,Ricardo Antunes,Borisas Bursteinas,Wudong Liu

doi:10.1093/nar/gkx237

Andrew Nightingale, Maria Martin + Show 8 more

Open Access

https://doi.org/10.1093/nar/gkx237

Copy DOI

Journal: Nucleic Acids Research	Publication Date: Apr 5, 2017
Citations: 74	License type: CC BY 4.0

Affiliation: European Bioinformatics Institute

Abstract

The Proteins API provides searching and programmatic access to protein and associated genomics data such as curated protein sequence positional annotations from UniProtKB, as well as mapped variation and proteomics data from large scale data sources (LSS). Using the coordinates service, researchers are able to retrieve the genomic sequence coordinates for proteins in UniProtKB. This, the LSS genomics and proteomics data for UniProt proteins is programmatically only available through this service. A Swagger UI has been implemented to provide documentation, an interface for users, with little or no programming experience, to ‘talk’ to the services to quickly and easily formulate queries with the services and obtain dynamically generated source code for popular programming languages, such as Java, Perl, Python and Ruby. Search results are returned as standard JSON, XML or GFF data objects. The Proteins API is a scalable, reliable, fast, easy to use RESTful services that provides a broad protein information resource for users to ask questions based upon their field of expertise and allowing them to gain an integrated overview of protein annotations available to aid their knowledge gain on proteins in biological processes. The Proteins API is available at (http://www.ebi.ac.uk/proteins/api/doc).

Highlights

Discovering and understanding biological processes and diseases can be enormously cumbersome, requiring the integration and analysis of a large number of observations from world-wide produced experimental data and information collected and curated in biological resources; having access to resources that collate and interpret biological data as meaningful meta information is essential for the scientific discovery process.The Universal Protein Resource (UniProt) [1] is a comprehensive resource for protein sequence and annotation data
The UniProt Knowledgebase (UniProtKB) annotations provides detailed sequence positional functional information of protein entries along with cross-references to over 150 databases acting as a central hub of protein information
UniProt collaborates with other bioinformatics resources, such as genomics resources Ensembl [2] and ClinVar [3] and proteomics resources PRIDE [4], PeptideAtlas [5,6] and MaxQB [7] to provide mappings between the resources and the large scale experimental data sets they provide

Summary

Introduction

Discovering and understanding biological processes and diseases can be enormously cumbersome, requiring the integration and analysis of a large number of observations from world-wide produced experimental data and information collected and curated in biological resources; having access to resources that collate and interpret biological data as meaningful meta information is essential for the scientific discovery process.The Universal Protein Resource (UniProt) [1] is a comprehensive resource for protein sequence and annotation data. We have developed the Proteins API, a REST web service, to provide programmatic access to protein sequence information and additional resources such as genomic coordinates mapping, antibody antigen sequences and mapped proteomics sequencing peptides; to enable researchers to visualize and integrate a broader range of biological data in to their analyses.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Proteins API: accessing key integrated protein and genome information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

From chemoproteomic-detected amino acids to genomic coordinates: insights into precise multi-omic data integration.
Maria F Palafox ... Heta S Desai
Molecular Systems Biology | VOL. 17
Maria F Palafox, et. al.Maria F Palafox ... Heta S Desai
01 Feb 2021
Molecular Systems Biology | VOL. 17

Towards visual analytics for digging into human rights violations data
-
Proceedings of the American Society for Information Science and Technology | VOL. 51
--
01 Jan 2014
Proceedings of the American Society for Information Science and Technology | VOL. 51

Automatic accuracy assessment via hashing in multiple-source environment
Jingyu Han ... Lingjuan Li
Expert Systems With Applications | VOL. 37
Jingyu Han, et. al.Jingyu Han ... Lingjuan Li
26 Aug 2009
Expert Systems With Applications | VOL. 37

Processing Shotgun Proteomics Data on the Amazon Cloud with the Trans-Proteomic Pipeline
Joseph Slagel ... Robert L Moritz
Molecular & Cellular Proteomics | VOL. 14
Joseph Slagel, et. al.Joseph Slagel ... Robert L Moritz
01 Feb 2015
Molecular & Cellular Proteomics | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Proteins API: accessing key integrated protein and genome information.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Nucleic Acids Research