Abstract

The peak shifts may lead to an incorrect statistical result for nontargeted metabolomics profiling, such as classification and discrimination in pattern recognition. In the paper, a more accurate alignment algorithm is developed based on Subwindow Factor Analysis and Mass Spectral information (SFA-MS). Compared with other methods, this new algorithm aligns the peaks more accurately without changing their shapes, especially for the overlapping peak clusters. To begin, the Continuous Wavelet Transform with Haar wavelet as the mother wavelet (Haar CWT) is used to determine the position and width of peaks. On this basis, the candidate drift points are confirmed by Fast Fourier Transform (FFT) cross correlation. Furthermore, the MS fitting degree of the common components between the reference chromatogram and the raw chromatogram is determined by the Subwindow Factor Analysis (SFA). When the MS information between reference and raw peaks is identical, the corresponding moving points are the optimum shifts. It is remarkable that all the peaks are moved through linear interpolation in the non-peak parts, so that the aligned chromatograms remain unchanged. The SFA-MS algorithm was implemented in the Matlab language and is available as an open source package.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call