Precision mapping of site-specific glycans using mass spectrometry is vital in glycoproteomics. However, the diversity of glycan compositions across species often exceeds database capacity, hindering the identification of rare glycans. Here, we introduce pGlycoNovo, a software within the pGlyco3 software environment, which employs a glycan first-based full-range Y-ion dynamic searching strategy. pGlycoNovo enables de novo identification of intact glycopeptides with rare glycans by considering all possible monosaccharide combinations, expanding the glycan search space to 16~1000 times compared to non-open search methods, while maintaining accuracy, sensitivity and speed. Reanalysis of SARS Covid-2 spike protein glycosylation data revealed 230 additional site-specific N-glycans and 30 previously unreported O-glycans. pGlycoNovo demonstrated high complementarity to six other tools and superior search speed. It enables characterization of site-specific N-glycosylation across five evolutionarily distant species, contributing to a dataset of 32,549 site-specific glycans on 4602 proteins, including 2409 site-specific rare glycans, and uncovering unexpected glycan fragments.
Read full abstract