ABSTRACT It is urgent to identify and screen emerging pollutants (EPs), which have caused great harm to human health and the environment. In their detection of liquid chromatography-mass spectrometry (LC-MS), the quantitative structure–retention relationship (QSRR) model is simple and efficient to predict the retention behavior of compounds. In the present work, we collected more data with the relative retention time (RRT) of 490 compounds, and filtered the molecular descriptors with lasso regression and multiple linear regression analysis. Then ten important molecular descriptors were screened and applied the QSRR models with deep neural network (DNN), multiple linear regression (MLR), and support vector machine. The DNN model had the best accuracy which the correlation coefficient R2 reached 0.913. Finally, we determined the applicability of the DNN model through a descriptor value range to assist in the identification and screening of EPs.
Read full abstract