Abstract

The attachment of N-acetylglucosamine to serine or threonine residues (O-GlcNAc) is a post-translational modification on nuclear and cytoplasmic proteins with emerging roles in numerous cellular processes, such as signal transduction, transcription, and translation. It is further presumed that O-GlcNAc can exhibit a site-specific, dynamic and possibly functional interplay with phosphorylation. O-GlcNAc proteins are commonly identified by tandem mass spectrometry following some form of biochemical enrichment. In the present study, we assessed if, and to which extent, O-GlcNAc-modified proteins can be discovered from existing large-scale proteome data sets. To this end, we conceived a straightforward O-GlcNAc identification strategy based on our recently developed Oscore software that automatically analyzes tandem mass spectra for the presence and intensity of O-GlcNAc diagnostic fragment ions. Using the Oscore, we discovered hundreds of O-GlcNAc peptides not initially identified in these studies, and most of which have not been described before. Merely re-searching this data extended the number of known O-GlcNAc proteins by almost 100 suggesting that this modification exists even more widely than previously anticipated and the modification is often sufficiently abundant to be detected without enrichment. However, a comparison of O-GlcNAc and phospho-identifications from the very same data indicates that the O-GlcNAc modification is considerably less abundant than phosphorylation. The discovery of numerous doubly modified peptides (i.e. peptides with one or multiple O-GlcNAc or phosphate moieties), suggests that O-GlcNAc and phosphorylation are not necessarily mutually exclusive, but can occur simultaneously at adjacent sites.

Highlights

  • The modification of proteins with N-acetylglucosamine (OGlcNAc)1 is an emerging dynamic post-translational modification of serine or threonine residues of proteins

  • Oscore-based O-GlcNAc Protein Identification Strategy— We recently developed the Oscore as a means to assess the probability of a tandem MS spectrum to represent an OGlcNAc modified peptide [12]

  • The first data set comprises the label-free comparison of 11 commonly used cell lines [14]; the second data set comprises a comprehensive characterization of the HeLa cancer cell line proteome employing multiple protease digestion [15], and the third data set represents an iTRAQ-based quantitative comparison of the proteome and the phospho-proteome of four human embryonic stem cell lines and four induced pluripotent stem cell lines [16]

Read more

Summary

EXPERIMENTAL PROCEDURES

Available Data—Publically available raw mass spectrometric data from published proteome-wide studies of 11 different cell lines [14], HeLa cells [15], as well as data from published proteomewide and phospho-proteome studies of hES and iPS cells [16] were downloaded from respective repositories (see supplemental Table S1). The resulting peak list files were processed by the Oscore perl script, which calculates the Oscore for every peptide precursor for which the tandem MS spectrum contains at least one diagnostic O-GlcNAc feature within a tolerance of 10 ppm. The peak list files were searched with Mascot 2.3.0 against the UniProtKB complete human (download date 26.10.2010, 110,550 sequences) combined with sequences of common contaminants. In case of the phospho-proteome dataset of hES and iPS cells [16], the spectra were searched against a subset database generated with Scaffold 3.3.1 (Proteome Software, Portland, OR) including only protein identifications from the respective full proteome data set (11,288 sequences). The targetdecoy option of Mascot was enabled and peptide mass tolerance was set to 10 ppm and fragment mass tolerance to 0.02 Da. Search results were imported into Scaffold 3.3.1. The Oscore script is available from www.wzw.tum.de/proteomics/ content/research/software/; and the peaklist files for all processed data can be downloaded from ProteomeCommons.org Tranche using the following hash key: ChunHqKHVaLCoocgKoyBjphK1QntOh6ehU0MzuLgwfϩFZHjEfAnt Iyz zY38 Rv05 1iV NoNF NJ QHi bLYJl 4dDRotCm 1UA AA AAA AAEpgϭϭ(passphrase: sa3sh7mgcf6eolskt57p)

RESULTS AND DISCUSSION
Total ϩ phospho
CONCLUSIONS
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.