Abstract

DeepResolution (Deep learning-assisted multivariate curve Resolution) has been proposed to solve the co-eluting problem for GC-MS data. However, DeepResolution models must be retrained when encountering unknown components, which is undoubtedly time-consuming and burdensome. In this study, a new pipeline named DeepResoution2 was proposed to overcome these limitations. DeepResolution2 utilizes deep neural networks to divide the profile into segments, estimate the number of components in each segment, and predict the elution region of each component. Subsequently, the information obtained by these deep learning models is used to assist the multivariate curve resolution procedure. Only seven models (1 + 1 + 5) are required to automate the whole analysis procedure of untargeted GC-MS data, which is an important improvement over DeepResolution. These seven models are stable and universal. Once established, they can be used to resolve most GC-MS data. Compared with MS-DIAL, ADAP-GC, and AMDIS, DeepResolution2 can obtain more reasonable mass spectra, chromatograms and peak areas to identify and quantify compounds. DeepResoution2 (0.955) outperformed AMDIS (0.939), MS-DIAL (0.948) and ADAP-GC (0.860) in terms of the linear correlation between concentrations and peak areas on overlapped peaks in fatty acid dataset. In real biological samples of human male infertility plasma, the peak areas and mass spectra of 136 untargeted GC-MS files were automatically extracted by DeepResolution2 without any prior information and manual intervention. DeepResolution2 includes all the functions for analyzing untargeted GC-MS datasets from the feature extraction of raw data files to the establishment of discriminant models.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call