Accuracy Metrics Research Articles

Abstract Background Atherosclerosis is the main underlying cause of cardiovascular disease (CVD). Existing CVD risk assessment tools do not consider the burden of subclinical atherosclerosis. The presence of carotid plaques on carotid ultrasound is a well-known marker of subclinical atherosclerosis. The accumulation of population-scale data on the presence of atherosclerotic plaques, along with deep phenotyping, can allow not only to address the effectiveness of carotid ultrasound in routine clinical practice, but to shed light on the biology of atherosclerosis development. Purpose To develop an effective deep learning model for plaque detection in carotid ultrasound images in the UK Biobank. Methods We used 680 carotid ultrasound images with manually annotated plaques to train a deep learning model employing the YOLOv8 architecture. Different augmentation techniques were used to increase the generalizability of the model. The developed model was applied to automatically detect plaques in raw ultrasound images from 19,507 UK Biobank participants. Logistic and Cox regression were used to explore the associations of plaque presence and number as predicted by the model with conventional CVD risk factors and the risk of future CVD events over follow-up. To explore the genetic architecture of subclinical atherosclerosis, we conducted a genome-wide association study (GWAS) on plaque presence, followed by meta-analysis with data from the CHARGE Consortium. Results Our plaque detection model achieved high classification metrics of accuracy, sensitivity, and specificity (89.3%, 89.5%, and 89.2%, respectively) and detected atherosclerotic plaques in 44% of UK Biobank participants. As expected, plaques were more common among men than women and their prevalence increased linearly with age. Both plaque presence and number of plaques were correlated with conventional CVD risk factors including diabetes, hypertension, and hyperlipidemia, and showed strong associations with future risk of incident CVD events (Hazard Ratio for plaque presence: 1.48 [95%CI: 1.21-1.82], for 2 plaques or more: 1.65, [95% CI: 1.28-2.13]). Incorporating plaque-derived phenotypes minimally altered the C-index of the time-to-event model. GWAS meta-analysis of carotid plaque presence revealed 5 previously known loci, as well as a significant locus including the LPA gene that had not previously been associated with carotid plaque. Conclusion We have developed and implemented an efficient plaque detection model to data from the UK Biobank, which holds significant promise for studying atherosclerosis at a population-wide scale through integration with multiomics data and electronic health records.

Read full abstract

Several studies report the benefits and accuracy of using autosegmentation for organ at risk (OAR) outlining in radiotherapy treatment planning. Typically, evaluations focus on accuracy metrics, and other parameters such as perceived utility and safety are routinely ignored. Here, we report our finding from the implementation and clinical evaluation of OSAIRIS, an open-source AI model for radiotherapy image segmentation that was carried out as part of its development into a medical device. The device contours OARs in the head and neck and male pelvis (referred to as the prostate model), and is designed to be used as a time-saving workflow device, alongside a clinician. Unlike standard evaluation processes, which heavily rely on accuracy metrics alone, our evaluation sought to demonstrate the tangible benefits, quantify utility and assess risk within a specific clinical workflow. We evaluated the time-saving benefit this device affords to clinicians, and how this time-saving might be linked to accuracy metrics, as well as the clinicians' assessment of the usability of the OSAIRIS contours in comparison to their colleagues' contours and those from other commercial AI contouring devices. Our safety evaluation focused on whether clinicians can notice and correct any errors should they be included in the output of the device.We found that OSAIRIS affords a significant time-saving of 36% (5.4 ± 2.1 minutes) when used for prostate contouring and 67% (30.3 ± 8.7 minutes) for head and neck contouring. Combining editing time data with accuracy metrics, we found the Hausdorff distance best correlated with editing-time, outperforming dice, the industry-standard, with a Spearman correlation coefficient of 0.70, and a Kendall coefficient of 0.52. Our safety and risk-mitigation exercise showed that anchoring bias is present when clinicians edit AI-generated contours, with the effect seemingly more pronounced for some structures over others. Most errors, however, were corrected by clinicians, with 72% of the head and neck errors 81% of the prostate errors removed in the editing step. Notably, our blinded clinician contour rating exercise showed that gold standard clinician contours are not rated more highly than the AI-generated contours.We conclude that evaluations of AI in a clinical setting must consider the clinical workflow in which the device will be used, and not rely on accuracy metrics alone, in order to reliably assess the benefits, utility and safety of the device. The effects of human-AI inter-operation must be evaluated to accurately assess the practical usability and potential uptake of the technology, as demonstrated in our blinded clinical utility review. The clinical risks posed by the use of the device must be studied and mitigated as far as possible, and our ‘Mystery Shopping’ experiment provides a template for future such assessments.

Read full abstract

Accuracy Metrics Research Articles

Related Topics

Articles published on Accuracy Metrics

Machine learning based system for early heart disease detection and classification using audio signal processing approach

Advanced Image Classification Using Convolutional Neural Networks

Deep learning-based algorithm for staging secondary caries in bitewings.

Assessment of Artificial Intelligence Chatbot Responses to Common Patient Questions on Bone Sarcoma.

Using deep learning to detect atherosclerotic plaques on carotid ultrasound images in the UK Biobank

Evaluating Machine Learning Models for Stroke Prognosis and Prediction in Atrial Fibrillation Patients: A Comprehensive Meta-Analysis

Generation Method for HVAC Systems Design Schemes in Office Buildings Based on Deep Graph Generative Models

Visible feature engineering to detect fraud in black and red peppers

Oral screening of dental calculus, gingivitis and dental caries through segmentation on intraoral photographic images using deep learning

Implementation of Chaotic Neural Key Generation Algorithm For IoT Devices

Machine learning for pacemaker implantation prediction after TAVI using multimodal imaging data

A novel approach to detecting epileptic patients: complex network-based EEG classification

ViT-HHO: Optimized vision transformer for diabetic retinopathy detection using Harris Hawk optimization

An Echo State Network-Based Light Framework for Online Anomaly Detection: An Approach to Using AI at the Edge

A Labeling Intercomparison of Retrogressive Thaw Slumps by a Diverse Group of Domain Experts

Analyzing Unimproved Drinking Water Sources and Their Determinants Using Supervised Machine Learning: Evidence from the Somaliland Demographic Health Survey 2020

OSAIRIS: Lessons Learned From the Hospital-Based Implementation and Evaluation of an Open-Source Deep-Learning Model for Radiotherapy Image Segmentation

Fine-Tuned Bidirectional Encoder Representations From Transformers Versus ChatGPT for Text-Based Outpatient Department Recommendation: Comparative Study.

Experience in using artificial intelligence services for diagnosing compression fracture of vertebral body based on computed tomography – from testing to trials

Variational benchmarks for quantum many-body problems.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Accuracy Metrics Research Articles

Related Topics

Articles published on Accuracy Metrics

Machine learning based system for early heart disease detection and classification using audio signal processing approach

Advanced Image Classification Using Convolutional Neural Networks

Deep learning-based algorithm for staging secondary caries in bitewings.

Assessment of Artificial Intelligence Chatbot Responses to Common Patient Questions on Bone Sarcoma.

Using deep learning to detect atherosclerotic plaques on carotid ultrasound images in the UK Biobank

Evaluating Machine Learning Models for Stroke Prognosis and Prediction in Atrial Fibrillation Patients: A Comprehensive Meta-Analysis

Generation Method for HVAC Systems Design Schemes in Office Buildings Based on Deep Graph Generative Models

Visible feature engineering to detect fraud in black and red peppers

Oral screening of dental calculus, gingivitis and dental caries through segmentation on intraoral photographic images using deep learning

Implementation of Chaotic Neural Key Generation Algorithm For IoT Devices

Machine learning for pacemaker implantation prediction after TAVI using multimodal imaging data

A novel approach to detecting epileptic patients: complex network-based EEG classification

ViT-HHO: Optimized vision transformer for diabetic retinopathy detection using Harris Hawk optimization

An Echo State Network-Based Light Framework for Online Anomaly Detection: An Approach to Using AI at the Edge

A Labeling Intercomparison of Retrogressive Thaw Slumps by a Diverse Group of Domain Experts

Analyzing Unimproved Drinking Water Sources and Their Determinants Using Supervised Machine Learning: Evidence from the Somaliland Demographic Health Survey 2020

OSAIRIS: Lessons Learned From the Hospital-Based Implementation and Evaluation of an Open-Source Deep-Learning Model for Radiotherapy Image Segmentation

Fine-Tuned Bidirectional Encoder Representations From Transformers Versus ChatGPT for Text-Based Outpatient Department Recommendation: Comparative Study.

Experience in using artificial intelligence services for diagnosing compression fracture of vertebral body based on computed tomography – from testing to trials

Variational benchmarks for quantum many-body problems.