ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites.

Jan Hummel,Stefanie Wienkoop,Wolfram Weckwerth,Dirk Steinhauser,Waltraud Schulze,Michaela Niemann,Joachim Selbig,Dirk Walther

doi:10.1186/1471-2105-8-216

Jan Hummel, Stefanie Wienkoop + Show 6 more

Open Access

https://doi.org/10.1186/1471-2105-8-216

Copy DOI

Abstract

BackgroundIn the last decade, techniques were established for the large scale genome-wide analysis of proteins, RNA, and metabolites, and database solutions have been developed to manage the generated data sets. The Golm Metabolome Database for metabolite data (GMD) represents one such effort to make these data broadly available and to interconnect the different molecular levels of a biological system [1]. As data interpretation in the light of already existing data becomes increasingly important, these initiatives are an essential part of current and future systems biology.ResultsA mass spectral library consisting of experimentally derived tryptic peptide product ion spectra was generated based on liquid chromatography coupled to ion trap mass spectrometry (LC-IT-MS). Protein samples derived from Arabidopsis thaliana, Chlamydomonas reinhardii, Medicago truncatula, and Sinorhizobium meliloti were analysed. With currently 4,557 manually validated spectra associated with 4,226 unique peptides from 1,367 proteins, the database serves as a continuously growing reference data set and can be used for protein identification and quantification in uncharacterized biological samples. For peptide identification, several algorithms were implemented based on a recently published study for peptide mass fingerprinting [2] and tested for false positive and negative rates. An algorithm which considers intensity distribution for match correlation scores was found to yield best results. For proof of concept, an LC-IT-MS analysis of a tryptic leaf protein digest was converted to mzData format and searched against the mass spectral library. The utility of the mass spectral library was also tested for the identification of phosphorylated tryptic peptides. We included in vivo phosphorylation sites of Arabidopsis thaliana proteins and the identification performance was found to be improved compared to genome-based search algorithms. Protein identification by ProMEX is linked to other levels of biological organization such as metabolite, pathway, and transcript data. The database is further connected to annotation and classification services via BioMoby.ConclusionThe ProMEX protein/peptide database represents a mass spectral reference library with the capability of matching unknown samples for protein identification. The database allows text searches based on metadata such as experimental information of the samples, mass spectrometric instrument parameters or unique protein identifier like AGI codes. ProMEX integrates proteomics data with other levels of molecular organization including metabolite, pathway, and transcript information and may thus become a useful resource for plant systems biology studies. The ProMEX mass spectral library is available at .

Highlights

In the last decade, techniques were established for the large scale genome-wide analysis of proteins, RNA, and metabolites, and database solutions have been developed to manage the generated data sets
The database allows text searches based on metadata such as experimental information of the samples, mass spectrometric instrument parameters or unique protein identifier like Arabidopsis Genome Initiative (AGI) codes
We investigate here the utility of a mass spectral reference library by implementing a database consisting of 4,557 manually validated tryptic peptide product ion spectra of divers plant proteins generated by LC-ion trap mass spectrometer (IT-MS) and mass fragment intensity correlation search for protein identification

Summary

Introduction

Techniques were established for the large scale genome-wide analysis of proteins, RNA, and metabolites, and database solutions have been developed to manage the generated data sets. Peptide fragmentation in an ion trap mass spectrometer (IT-MS) is one of the most used approaches for protein identification in complex samples [3] One such technique, referred to as "shotgun proteomics", can be exploited for rapid screening and – in combination with further fractionation – comprehensive qualitative protein identification in complex samples [4,5]. We investigate here the utility of a mass spectral reference library by implementing a database consisting of 4,557 manually validated tryptic peptide product ion spectra of divers plant proteins generated by LC-IT-MS and mass fragment intensity correlation search for protein identification

Objectives

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jun 23, 2007
Citations: 112	License type: cc-by

R Discovery Prime

R Discovery Prime

ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

On the inter‐instrument and the inter‐laboratory transferability of a tandem mass spectral reference library. 3. Focus on ion trap and upfront CID
Herbert Oberacher ... Sylvia Grosse
Journal of Mass Spectrometry | VOL. 47
Herbert Oberacher, et. al.Herbert Oberacher ... Sylvia Grosse
01 Feb 2012
Journal of Mass Spectrometry | VOL. 47

Building and Searching Tandem Mass Spectral Libraries for Peptide Identification
Henry Lam
Molecular & Cellular Proteomics | VOL. 10
Henry LamHenry Lam
06 Sep 2011
Molecular & Cellular Proteomics | VOL. 10

Mass spectral libraries: A statistical review of the visible use
Boris L Milman ... Inna K Zhurkovich
TrAC Trends in Analytical Chemistry | VOL. 80
Boris L Milman, et. al.Boris L Milman ... Inna K Zhurkovich
04 May 2016
TrAC Trends in Analytical Chemistry | VOL. 80

HAMMER: automated operation of mass frontier to construct in silico mass spectral fragmentation libraries
Jiarui Zhou ... Ralf J M Weber
Bioinformatics | VOL. 30
Jiarui Zhou, et. al.Jiarui Zhou ... Ralf J M Weber
11 Dec 2013
Bioinformatics | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

ProMEX: a mass spectral reference database for proteins and protein phosphorylation sites.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics