Abstract
Database-driven suspect screening has proven to be a useful tool to detect new psychoactive substances (NPS) outside the scope of targeted screening; however, the lack of retention times specific to a liquid chromatography (LC) system can result in a large number of false positives. A singular stream-lined, quantitative structure-retention relationship (QSRR)-based retention time prediction model integrating multiple LC systems with different elution conditions is presented using retention time data (n = 1281) from the online crowd-sourced database, HighResNPS. Modelling was performed using an artificial neural network (ANN), specifically a multi-layer perceptron (MLP), using four molecular descriptors and one-hot encoding of categorical labels. Evaluation of test set predictions (n = 193) yielded coefficient of determination (R2) and mean absolute error (MAE) values of 0.942 and 0.583 min, respectively. The model successfully differentiated between LC systems, predicting 54%, 81% and 97% of the test set within ±0.5, ±1 and ±2 min, respectively. Additionally, retention times for an analyte not previously observed by the model were predicted within ±1 min for each LC system. The developed model can be used to predict retention times for all analytes on HighResNPS for each participating laboratory's LC system to further support suspect screening.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have