Glycosylation is one of the most common post-translational modifications (PTMs). Protein glycosylation analysis is the bottleneck to deeply understand their functions. At present, the LC-MS analysis of glycosylated post-translational modification is mainly focused on the analysis of glycopeptides. However, the factors affecting the identification of glycopeptides were not fully elucidated. In the paper, we have carefully studied the factors, e.g., HILIC materials, search engines, protein amount, gradient duration, extraction solution, etc. According to the results, HILIC materials were the most important factors affecting the glycopeptides identification, and the amphoteric sulfoalkyl betaine stationary phase enriched glycopeptides 6-fold more compared to the amphiphilic ion-bonded fully porous spherical silica stationary phase. We explored the influence of the extraction solutions on glycan identification. Comparing sodium dodecyl sulfate (SDS) and urea (UA), the results showed that N-glycolylneuraminic acid (NeuGc) type of glycan content was found to be increased 1.4-fold in the SDS compared to UA. Besides, we explored the influence of the search engine on glycopeptide identification. Comparing pGlyco3.0 and MSFragger-Glyco, it was observed that pGlyco3.0 outperformed MSFragger-Glyco in identifying glycopeptides. Then, using our optimized method we found that there was a significant difference in the distribution of monosaccharide types in plasma and brain tissue, e.g., the content of NeuAc in brain was 5-fold higher than that in plasma. To importantly, two glycoproteins (Neurexin-2 and SUN domain-containing protein 2) were also found for the first time by our method. In summary, we have comprehensively studied the factors influencing glycopeptide identification than any previous research, and the optimized method could be widely used for identifying the glycoproteins or glycolpeptides biomarkers for disease detection and therapeutic targets.
Read full abstract