An XIC-Centric Strategy for Improved Identification and Quantification in Proteomic Data Analyses.

Guanghui Wang,Sergey L Sheetlin,Stephen E Stein,Yi Liu,Meghan C Burke,Zheng Zhang

doi:10.1021/acs.jproteome.3c00633

Abstract

Reproducibility is a "proteomic dream" yet to be fully realized. A typical data analysis workflow utilizing extracted ion chromatograms (XICs) often treats the information path from identification to quantification as a one-way street. Here, we propose an XIC-centric approach in which the data flow is bidirectional: identifications are used to derive XICs whose information is in turn applied to validate the identifications. In this study, we employed liquid chromatography-mass spectrometry data from glycoprotein and human hair samples to illustrate the XIC-centric concept. At the core of this approach was XIC-based monoisotope repicking. Taking advantage of the intensity information for all detected isotopes across the whole range of an XIC peak significantly improved the accuracy and uncovered misidentifications originating from monoisotope assignment mistakes. It could also rescue non-top-ranked glycopeptide hits. Identification of glycopeptides is particularly susceptible to precursor mass errors for their low abundances, large masses, and glycans differing by 1 or 2 Da easily confused as isotopes. In addition, the XIC-centric strategy significantly reduced the problem of one XIC peak associated with multiple unique identifications, a source of quantitative irreproducibility. Taken together, the proposed approach can lead to improved identification and quantification accuracy and, ultimately, enhanced reproducibility in proteomic data analyses.

Full Text