Best-performing Model Research Articles

With the exponential progress in the field of cheminformatics, the conventional modeling approaches have so far been to employ supervised and unsupervised machine learning (ML) and deep learning models, utilizing the standard molecular descriptors, which represent the structural, physicochemical, and electronic properties of a particular compound. Deviating from the conventional approach, in this investigation, we have employed the classification Read-Across Structure-Activity Relationship (c-RASAR), which involves the amalgamation of the concepts of classification-based quantitative structure-activity relationship (QSAR) and Read-Across to incorporate Read-Across-derived similarity and error-based descriptors into a statistical and machine learning modeling framework. ML models developed from these RASAR descriptors use similarity-based information from the close source neighbors of a particular query compound. We have employed different classification modeling algorithms on the selected QSAR and RASAR descriptors to develop predictive models for efficient prediction of query compounds’ hepatotoxicity. The predictivity of each of these models was evaluated on a large number of test set compounds. The best-performing model was also used to screen a true external data set. The concepts of explainable AI (XAI) coupled with Read-Across were used to interpret the contributions of the RASAR descriptors in the best c-RASAR model and to explain the chemical diversity in the dataset. The application of various unsupervised dimensionality reduction techniques like t-SNE and UMAP and the supervised ARKA framework showed the usefulness of the RASAR descriptors over the selected QSAR descriptors in their ability to group similar compounds, enhancing the modelability of the dataset and efficiently identifying activity cliffs. Furthermore, the activity cliffs were also identified from Read-Across by observing the nature of compounds constituting the nearest neighbors for a particular query compound. On comparing our simple linear c-RASAR model with the previously reported models developed using the same dataset derived from the US FDA Orange Book (https://www.accessdata.fda.gov/scripts/cder/ob/index.cfm), it was observed that our model is simple, reproducible, transferable, and highly predictive. The performance of the LDA c-RASAR model on the true external set supersedes that of the previously reported work. Therefore, the present simple LDA c-RASAR model can efficiently be used to predict the hepatotoxicity of query chemicals.

Background: Although radiology reports are commonly used for lung cancer staging, this task can be challenging given radiologists' variable reporting styles as well as reports' potentially ambiguous and/or incomplete staging-related information. Objective: To compare performance of ChatGPT large-language models (LLMs) and human readers of varying experience in lung cancer staging using chest CT and FDG PET/CT free-text reports. Methods: This retrospective study included 700 patients (mean age, 73.8±29.5 years; 509 male, 191 female) from four institutions in Korea who underwent chest CT or FDG PET/CT for non-small cell lung cancer initial staging from January, 2020 to December, 2023. Examinations' reports used a free-text format, written exclusively in English or in mixed English and Korean. Two thoracic radiologists in consensus determined the overall stage group (IA, IB, IIA, IIB, IIIA, IIIB, IIIC, IVA, IVB) for each report using the AJCC 8th-edition staging system, establishing the reference standard. Three ChatGPT models (GPT-4o, GPT-4, GPT-3.5) determined an overall stage group for each report using a script-based application programming interface, zero-shot learning, and prompt incorporating a staging system summary. Six human readers (two fellowship-trained radiologists with lesser experience than the radiologists who determined the reference standard, two fellows, two residents) also independently determined overall stage groups. GPT-4o's overall accuracy for determining the correct stage among the nine groups was compared with that of the other LLMs and human readers using McNemar tests. Results: GPT-4o had an overall staging accuracy of 74.1%, significantly better than the accuracy of GPT-4 (70.1%, p=.02), GPT-3.5 (57.4%, p<.001), and resident 2 (65.7%, p<.001); significantly worse than the accuracy of fellowship-trained radiologist 1 (82.3%, p<.001) and fellowship-trained radiologist 2 (85.4%, p<.001); and not significantly different from the accuracy of fellow 1 (77.7%, p=.09), fellow 2 (75.6%, p=.53), and resident 1 (72.3%, p=.42). Conclusions: The best-performing model, GPT-4o, showed no significant difference in staging accuracy versus fellows, but significantly worse performance versus fellowship-trained radiologists. The findings do not support use of LLMs for lung cancer staging in place of expert healthcare professionals. Clinical Impact: The findings indicate the importance of domain expertise for performing complex specialized tasks such as cancer staging.

Best-performing Model Research Articles

Related Topics

Articles published on Best-performing Model

Attention-Driven Transfer Learning Model for Improved IoT Intrusion Detection

Comparative Analysis of Machine Learning Techniques for Water Consumption Prediction: A Case Study from Kocaeli Province.

Prediction of aeration performance of different types of piano key weirs using different machine learning models

Assessing plant pigmentation impacts: A novel approach integrating UAV and multispectral data to analyze atrazine metabolite effects from soil contamination

Development and calibration of a mathematical model of HIV outcomes among Rwandan adults: informing equitable achievement of targets in Rwanda.

Exoplanet Detection Using Machine Learning : A Comparative Study Using Kepler Mission Data

The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset

Temporal trends in asteroid behaviour: a machine learning and N-body integration approach

Lung Cancer Staging Using Chest CT and FDG PET/CT Free-Text Reports: Comparison Among Three ChatGPT Large-Language Models and Six Human Readers of Varying Experience.

Development of a Diagnostic Model for Pancreatic Ductal Adenocarcinoma Using Machine Learning and Blood-Based miRNAs

A machine learning approach towards assessing consistency and reproducibility: an application to graft survival across three kidney transplantation eras.

Geospatial modeling of wildfire susceptibility on a national scale in Montenegro: A comparative evaluation of F-AHP and FR methodologies

DSEception: a noval neural networks architecture for enhancing pneumonia and tuberculosis diagnosis.

Industrial water withdrawal prediction using multi-head attention encoder model

Promoting Sustainable Development of Coal Mines: CNN Model Optimization for Identification of Microseismic Signals Induced by Hydraulic Fracturing in Coal Seams

Polygenic risk score portability for common diseases across genetically diverse populations

Effectiveness of South Africa's network of protected areas: Unassessed vascular plants predicted to be threatened using deep neural networks are all located in protected areas.

Integrating StEP-COMPAC Definition and Enhanced Recovery after Surgery Status in a Machine-learning-based Model for Postoperative Pulmonary Complications in Laparoscopic Hepatectomy

Data-Driven Cycle Life Prediction of Lithium Metal-Based Rechargeable Battery Based on Discharge/Charge Capacity and Relaxation Features.

Interpretable Machine Learning Model Based on Superb Microvascular Imaging for Non-Invasive Determination of Crescent Status of IgAN.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Best-performing Model Research Articles

Related Topics

Articles published on Best-performing Model

Attention-Driven Transfer Learning Model for Improved IoT Intrusion Detection

Comparative Analysis of Machine Learning Techniques for Water Consumption Prediction: A Case Study from Kocaeli Province.

Prediction of aeration performance of different types of piano key weirs using different machine learning models

Assessing plant pigmentation impacts: A novel approach integrating UAV and multispectral data to analyze atrazine metabolite effects from soil contamination

Development and calibration of a mathematical model of HIV outcomes among Rwandan adults: informing equitable achievement of targets in Rwanda.

Exoplanet Detection Using Machine Learning : A Comparative Study Using Kepler Mission Data

The application of chemical similarity measures in an unconventional modeling framework c-RASAR along with dimensionality reduction techniques to a representative hepatotoxicity dataset

Temporal trends in asteroid behaviour: a machine learning and N-body integration approach

Lung Cancer Staging Using Chest CT and FDG PET/CT Free-Text Reports: Comparison Among Three ChatGPT Large-Language Models and Six Human Readers of Varying Experience.

Development of a Diagnostic Model for Pancreatic Ductal Adenocarcinoma Using Machine Learning and Blood-Based miRNAs

A machine learning approach towards assessing consistency and reproducibility: an application to graft survival across three kidney transplantation eras.

Geospatial modeling of wildfire susceptibility on a national scale in Montenegro: A comparative evaluation of F-AHP and FR methodologies

DSEception: a noval neural networks architecture for enhancing pneumonia and tuberculosis diagnosis.

Industrial water withdrawal prediction using multi-head attention encoder model

Promoting Sustainable Development of Coal Mines: CNN Model Optimization for Identification of Microseismic Signals Induced by Hydraulic Fracturing in Coal Seams

Polygenic risk score portability for common diseases across genetically diverse populations

Effectiveness of South Africa's network of protected areas: Unassessed vascular plants predicted to be threatened using deep neural networks are all located in protected areas.

Integrating StEP-COMPAC Definition and Enhanced Recovery after Surgery Status in a Machine-learning-based Model for Postoperative Pulmonary Complications in Laparoscopic Hepatectomy

Data-Driven Cycle Life Prediction of Lithium Metal-Based Rechargeable Battery Based on Discharge/Charge Capacity and Relaxation Features.

Interpretable Machine Learning Model Based on Superb Microvascular Imaging for Non-Invasive Determination of Crescent Status of IgAN.