Precision And Recall Research Articles

PurposeTo evaluate the performance of a large language model (LLM) in classifying electronic health record (EHR) text, and to use this classification to evaluate the type and resolution of hemorrhagic events (HE) following micro-invasive glaucoma surgery (MIGS). DesignRetrospective cohort study. ParticipantsEyes from the Bascom Palmer Glaucoma Repository. MethodsEyes that underwent MIGS between July 1, 2014 and February 1, 2022 were analyzed. ChatGPT was used to classify deidentified EHR anterior chamber examination text into HE categories (no hyphema, microhyphema, clot, and hyphema). Agreement between classifications by ChatGPT and a glaucoma specialist was evaluated using Cohen’s Kappa and precision-recall (PR) curve. Time to resolution of HEs was assessed using Cox proportional-hazards models. Goniotomy HE resolution was evaluated by degree of angle treatment (90-179º, 180-269º, 270-360º). Logistic regression was used to identified HE risk factors. Main Outcome MeasuresAccuracy of ChatGPT HE classification and incidence and resolution of HEs. ResultsThe study included 434 goniotomy eyes (368 patients) and 528 Schlemm’s Canal Stent (SCS) eyes (390 patients). ChatGPT facilitated excellent HE classification (Cohen’s kappa 0.93, area under PR curve 0.968). Using ChatGPT classifications, at postoperative day 1, HEs occurred in 67.8% of goniotomy and 25.2% of SCS eyes (p<0.001). The 270-360º goniotomy group had the highest HE rate (84.0%, p<0.001). At postoperative week 1, HEs were observed in 43.4% and 11.3% of goniotomy and SCS eyes respectively (p<0.001). By postoperative month 1, HE rates were 13.3% and 1.3% among goniotomy and SCS eyes respectively (p<0.001). Time to HE resolution differed between the goniotomy angle groups (log-rank p=0.034); median time to resolution was 10, 10, and 15 days for the 90-179º, 180-269º, and 270-360º groups respectively. Risk factor analysis demonstrated greater goniotomy angle was the only significant predictor of HEs (OR for 270-360º: 4.08, p<0.001). ConclusionsLLMs can be effectively used to classify longitudinal EHR free-text exam data with high accuracy, highlighting a promising direction for future LLM-assisted research and clinical decision support. HEs are relatively common, self-resolving complications that occur more often in goniotomy cases and with larger goniotomy treatments. Time to HE resolution differs significantly between goniotomy groups.

Read full abstract

PurposeTo develop and validate machine learning (ML) models to predict choroidal nevus transformation to melanoma based on multimodal imaging at initial presentation. DesignRetrospective multicenter study. ParticipantsPatients diagnosed with choroidal nevus on the Ocular Oncology Service at Wills Eye Hospital (2007-2017) or Mayo Clinic Rochester (2015-2023). MethodsMultimodal imaging was obtained, including fundus photography, fundus autofluorescence, spectral domain OCT, and B-scan ultrasonography. ML models were created (XGBoost, LGBM, Random Forest, Extra Tree) and optimized for area under receiver operating curve (AUROC). The Wills Eye Hospital cohort was utilized for training and testing (80% training-20% testing) with 5-fold cross validation. The Mayo Clinic cohort provided external validation. Model performance was characterized by AUROC and area under the precision recall curve (AUPRC). Models were interrogated using SHapley Additive exPlanations (SHAP) to identify the features most predictive of conversion from nevus to melanoma. Differences in AUROC and AUPRC between models were tested using 10,000 bootstrap samples with replacement and results. Main Outcome MeasuresAUROC and AUPRC for each ML model. ResultsThere were 2,870 nevi included in the study, with conversion to melanoma confirmed in 128 cases. Simple AI Nevus Transformation System (SAINTS; XGBoost) was the top performing model in the test cohort [pooled AUROC 0.864 (95% confidence interval (CI): 0.864-0.865), pooled AUPRC 0.244 (0.243-0.246)] and in the external validation cohort [pooled AUROC 0.931 (95% CI: 0.930-0.931), pooled AUPRC 0.533 (0.531-0.535)]. Other models also had good discriminative performance: LGBM (test set pooled AUROC 0.831, validation set pooled AUROC 0.815), Random Forest (test set pooled AUROC 0.812, validation set pooled AUROC 0.866), and Extra Tree (test set pooled AUROC 0.826, validation set pooled AUROC 0.915). A model including only nevi with at least 5 years of follow up demonstrated the best performance in AUPRC (test: pooled 0.592 (95% CI: 0.590-0.594); validation: pooled 0.656 (95% CI: 0.655-0.657)). The top five features in SAINTS by SHAP values were: tumor thickness, largest tumor basal diameter, tumor shape, distance to optic nerve, and subretinal fluid extent. ConclusionsWe demonstrate accuracy and generalizability of a ML model for predicting choroidal nevus transformation to melanoma based on multimodal imaging.

Read full abstract

Precision And Recall Research Articles

Related Topics

Articles published on Precision And Recall

Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation

Deep-learning-based method for the segmentation of ureter and renal pelvis on non-enhanced CT scans

Fund transfer fraud detection: Analyzing irregular transactions and customer relationships with self-attention and graph neural networks

Optimizing machine learning algorithms for diabetes data: A metaheuristic approach to balancing and tuning classifiers parameters

Development and Validation of Machine Learning Algorithms for Prediction of Colorectal Polyps Based on Electronic Health Records.

The Completeness of Accreting Neutron Star Binary Candidates from the Chinese Space Station Telescope

ChatGPT-Assisted Classification of Postoperative Bleeding Following Microinvasive Glaucoma Surgery Using Electronic Health Record Data

Enhancing corporate bankruptcy prediction via a hybrid genetic algorithm and domain adaptation learning architecture

English grammar intelligent error correction technology based on the n-gram language model

Prediction of vasopressor needs in hypotensive emergency department patients using serial arterial blood pressure data with deep learning

Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.

A foundational transformer leveraging full night, multichannel sleep study data accurately classifies sleep stages.

Ensemble learning techniques against structured query language injection attacks

Deep Learning-Based Electrocardiogram Analysis Predicts Biventricular Dysfunction and Dilation in Congenital Heart Disease

High Prevalence of Artifacts in Optical Coherence Tomography With Adequate Signal Strength.

Interpretable machine learning models for predicting clinical pregnancies associated with surgical sperm retrieval from testes of different etiologies: a retrospective study

Machine-learning-based prediction of disability progression in multiple sclerosis: An observational, international, multi-center study.

LSGDDN-LCD: An appearance-based loop closure detection using local superpixel grid descriptors and incremental dynamic nodes

Crowd Density Level Classification for Service Waiting Room Based on Head Detection to Enhance Visitor Experience

Predicting Choroidal Nevus Transformation to Melanoma Using Machine Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Precision And Recall Research Articles

Related Topics

Articles published on Precision And Recall

Improving Uncertainty-Error Correspondence in Deep Bayesian Medical Image Segmentation

Deep-learning-based method for the segmentation of ureter and renal pelvis on non-enhanced CT scans

Fund transfer fraud detection: Analyzing irregular transactions and customer relationships with self-attention and graph neural networks

Optimizing machine learning algorithms for diabetes data: A metaheuristic approach to balancing and tuning classifiers parameters

Development and Validation of Machine Learning Algorithms for Prediction of Colorectal Polyps Based on Electronic Health Records.

The Completeness of Accreting Neutron Star Binary Candidates from the Chinese Space Station Telescope

ChatGPT-Assisted Classification of Postoperative Bleeding Following Microinvasive Glaucoma Surgery Using Electronic Health Record Data

Enhancing corporate bankruptcy prediction via a hybrid genetic algorithm and domain adaptation learning architecture

English grammar intelligent error correction technology based on the n-gram language model

Prediction of vasopressor needs in hypotensive emergency department patients using serial arterial blood pressure data with deep learning

Predicting postoperative delirium assessed by the Nursing Screening Delirium Scale in the recovery room for non-cardiac surgeries without craniotomy: A retrospective study using a machine learning approach.

A foundational transformer leveraging full night, multichannel sleep study data accurately classifies sleep stages.

Ensemble learning techniques against structured query language injection attacks

Deep Learning-Based Electrocardiogram Analysis Predicts Biventricular Dysfunction and Dilation in Congenital Heart Disease

High Prevalence of Artifacts in Optical Coherence Tomography With Adequate Signal Strength.

Interpretable machine learning models for predicting clinical pregnancies associated with surgical sperm retrieval from testes of different etiologies: a retrospective study

Machine-learning-based prediction of disability progression in multiple sclerosis: An observational, international, multi-center study.

LSGDDN-LCD: An appearance-based loop closure detection using local superpixel grid descriptors and incremental dynamic nodes

Crowd Density Level Classification for Service Waiting Room Based on Head Detection to Enhance Visitor Experience

Predicting Choroidal Nevus Transformation to Melanoma Using Machine Learning