Abstract

AbstractQuantitative structure‐property relationships (QSPRs) have been developed to predict the ion mobility spectrometry (IMS) drift time tD for a set of 1481 peptides generated by protease digestion of the Drosophila melanogaster proteome using information directly derived from molecular structures. The relationship between peptide structure and the drift time tD was constructed by using partial least squares regression (PLS), least‐squares support vector machine (LSSVM) and Gaussian process (GP) coupled with genetic algorithm‐variable selection. Among these models, the linear PLS was incapable of capturing all dependences in this peptide system; nonlinear LSSVM and GP methods presented a good statistical performance on reproducing peptide mobility behavior. Moreover, since GP was able to handling both linear and nonlinear‐hybrid relationship, it gave a stronger fitting ability and a better predictive power than the LSSVM. Systematic analysis of the GP model showed that diversified properties contribute remarkable effect to the relationship between the drift time and the peptide structure. Particularly, the structural topological information and charge distribution contribute significantly to the drift time of peptides.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call