Abstract
Finding gene-specific peptides by mass spectrometry analysis to pinpoint gene loci responsible for particular protein products is a major challenge in proteomics especially in highly conserved gene families in higher eukaryotes. We used a combination of in silico approaches coupled to mass spectrometry analysis to advance the proteomics insight into Arabidopsis cytosolic ribosomal composition and its post-translational modifications. In silico digestion of all 409 ribosomal protein sequences in Arabidopsis defined the proportion of theoretical gene-specific peptides for each gene family and highlighted the need for low m/z cutoffs of MS ion selection for MS/MS to characterize low molecular weight, highly basic ribosomal proteins. We undertook an extensive MS/MS survey of the cytosolic ribosome using trypsin and, when required, chymotrypsin and pepsin. We then used custom software to extract and filter peptide match information from Mascot result files and implement high confidence criteria for calling gene-specific identifications based on the highest quality unambiguous spectra matching exclusively to certain in silico predicted gene- or gene family-specific peptides. This provided an in-depth analysis of the protein composition based on 1446 high quality MS/MS spectra matching to 795 peptide sequences from ribosomal proteins. These identified peptides from five gene families of ribosomal proteins not identified previously, providing experimental data on 79 of the 80 different types of ribosomal subunits. We provide strong evidence for gene-specific identification of 87 different ribosomal proteins from these 79 families. We also provide new information on 30 specific sites of co- and post-translational modification of ribosomal proteins in Arabidopsis by initiator methionine removal, N-terminal acetylation, N-terminal methylation, lysine N-methylation, and phosphorylation. These site-specific modification data provide a wealth of resources for further assessment of the role of ribosome modification in influencing translation in Arabidopsis.
Highlights
Finding gene-specific peptides by mass spectrometry analysis to pinpoint gene loci responsible for particular protein products is a major challenge in proteomics especially in highly conserved gene families in higher eukaryotes
In Silico Digestion to Develop Strategies for Analysis of the Arabidopsis Ribosomal Proteome—To develop strategies to identify ribosomal proteins from the cytosolic ribosome with confidence based on peptide MS/MS, we undertook a wide survey of the predicted ribosomal protein sequences from Arabidopsis
Processing of Proteogest output using custom PHP scripts produced a table for each enzyme containing the entire list of peptide sequences theoretically produced and for each peptide the gene(s) and gene family(s) from which it could be derived and a list of those peptides that were predicted to come from only one of the 409 protein sequences
Summary
Finding gene-specific peptides by mass spectrometry analysis to pinpoint gene loci responsible for particular protein products is a major challenge in proteomics especially in highly conserved gene families in higher eukaryotes. We used custom software to extract and filter peptide match information from Mascot result files and implement high confidence criteria for calling gene-specific identifications based on the highest quality unambiguous spectra matching exclusively to certain in silico predicted gene- or gene family-specific peptides This provided an in-depth analysis of the protein composition based on 1446 high quality MS/MS spectra matching to 795 peptide sequences from ribosomal proteins. We provide new information on 30 specific sites of co- and post-translational modification of ribosomal proteins in Arabidopsis by initiator methionine removal, N-terminal acetylation, N-terminal methylation, lysine N-methylation, and phosphorylation. Rat, and human cytosolic ribosomes by peptide mass spectrometry have provided insights into the composition and post-translational modification of ribosomal proteins [3,4,5,6,7]. Counting of distinct protein spots on these gels suggested that the 40 S subunit contained up to 40 proteins, whereas the 60 S subunit contained up to 59 proteins; without genomic sequences it was not possible to systematically assign these proteins to genes and gene families that would be required to resolve the composition of the complex in plants in a genespecific manner
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.