The application of AI to analytical and separative sciences is a recent challenge that offers new perspectives in terms of data prediction. In this work, we report an AI-based software, named Chrompredict 1.0, which based on chromatographic data of a novel mesogenic crown ether stationary phase (CESP). Its molecular design represents a significant advancement due to the unique combination of properties and binding capabilities, including the formation of a cavity, mesogenic behavior via mobile chains, and a range of polar and non-polar interactions (aromatic rings, N=N and O=O double bonds, alkyl chains, π–π interactions, and hydrogen bonding). The mesogenic phase is effective in both normal and reversed-phase chromatography, enhancing the software's adaptability across diverse datasets.Here we introduce for the first time an unprecedented scientific approach, integrating deep learning techniques with the novel CESP, which demonstrates exceptional thermal and analytical performance in both liquid chromatography modes, especially in the separation of complex hydrocarbon isomers. This ability enables the results obtained with CESP to extend across various types of stationary phases.Leveraging these insights, a comprehensive chromatographic dataset on a series of aromatic and polyaromatic molecules interacting with our CESP was used to train a Deep Learning Model (DLM). This model is embedded within a user-friendly software, Chrompredict 1.0, designed for predicting chromatographic parameters (MAE = 0.042, R² = 0.95) by selecting chemical descriptors directly from SMILES notation. It offers a deeper understanding of molecular structure and interactions through exploratory data analysis, identifying key factors affecting model accuracy and chromatographic behavior.Users can configure hyperparameters, choose from six machine learning models, and compare their performance with DLM. Chrompredict 1.0 excels in retention behavior prediction for compounds with known structures, and it accurately predicts chromatographic retention and thermal characteristics for different temperatures in HPLC and GC. The model has been successfully tested with METLIN database of 1,023 small molecules of diverse structures and polarities (R² > 0.75, error range ±7.8 s). Overall, the CESP, combined with Chrompredict 1.0, offers a robust tool for intelligent chromatographic analysis, encompassing chemo-informatics, statistical analysis, and graphical capabilities across a broad range of compounds and stationary phases.
Read full abstract