The rapid expansion of scientific literature necessitates developing efficient data extraction and analysis methods. This study presents an innovative approach to automating the extraction of cryoprotectant information from scientific publications using a generative pretrained transformer (GPT) model integrated with a Telegram bot interface. Our system processes and analyzes scientific articles to identify and extract relevant data on cryoprotectants and bacteria, significantly reducing the time required for researchers to gather essential information. Our method optimizes the workflow for researchers in cryopreservation and related fields by utilizing modern artificial intelligence technologies, specifically large language models. The Telegram bot, designed to be user-friendly, provides a comfortable and easy platform for quick data access, enhancing scientific research efficiency. The study's methodology involves data preparation, algorithm development, and system validation using a substantial data set of scientific articles. Results demonstrate the model's capability to accurately recognize and extract critical information, although some limitations in term specificity were noted. Our findings suggest that further refinement and training of the model can enhance its accuracy and reliability for specialized scientific applications.
Read full abstract