Challenging Datasets Research Articles

Abstract Problem: This 21st century has produced an eruption of research using AI for the detection and diagnosis of cancer. Yet, an often-unspoken core premise in this field of computational pathology is that a glass slide suitably represents the patient’s disease. Here, we report systematic confounds may dominate slides from a medical center, such that slides are unsuitable for diagnosis. Methods: We mathematically define high quality data as a whole slide image set where the patient’s surgery may be accurately predicted by an automated system. Our system “iQC” accurately distinguished biopsies (i.e. thin strands of tissue) from nonbiopsies, e.g. transurethral resections (TURPs) or prostatectomies, only when the data appeared high quality, e.g. bright histopathology stains and few artifacts. Thus, when the data are of high quality, iQC (i) accurately classifies pixels as tissue, (ii) accurately generates stats that describe the distribution of tissue, and (iii) accurately predicts surgical procedure from those stats. We compare iQC against the published HistoQC tool. Results: iQC holds all data to the same objective quality standard. We validate this standard in five Veterans Affairs Medical Centers (VAMCs) and the public Automated Gleason Grading Challenge (AGGC) dataset. For the surgery prediction task, we report an AUROC of 0.9966-1.000 at VAMCs that produced high quality data and AUROC=0.9824 for AGGC. In contrast, we report AUROC=0.7115 at the VAMC that produced poor quality data. A pathologist found poor quality may be explained by faded histopathology stains and VAMC protocol differences. Supporting this, iQC's novel stain strength statistic finds this VAMC had weaker stains (p &lt; 2.2e-16, two-tailed Wilcoxon rank-sum test; Cohen's d=1.208) than the VAMC that contributed most of the slides. Additionally, iQC recommended only 2 of 3736 (0.005%) VAMC slides for review due to inadequate tissue. In contrast, HistoQC in its default configuration excluded 89.9% of VAMC slides because tissue was not detected, but we reduced this to 16.7% with our custom HistoQC configuration. Conclusion: Our surgery prediction AUROC may be a quantitative indicator positively associated with data quality for a dataset. Unless data are poor quality, iQC accurately locates tissue in slides and excludes few slides. iQC is, to our knowledge, the first automated system in computational pathology that validates quality against objective evidence, e.g. surgical procedure data available in the EHR/LIMS, which requires no efforts or annotations from anatomic pathologists. Citation Format: Andrew J. Schaumberg, Michael S. Lewis, Ramin Nazarian, Ananta Wadhwa, Nathanael Kane, Graham Turner, Purushotham Karnam, Poornima Devineni, Nicholas Wolfe, Randall Kintner, Matthew B. Rettig, Beatrice S. Knudsen, Isla P. Garraway, Saiju Pyarajan. iQC: machine-learning-driven prediction of surgery reveals systematic confounds in cancer whole slide images from hospitals by protocol [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2024; Part 1 (Regular Abstracts); 2024 Apr 5-10; San Diego, CA. Philadelphia (PA): AACR; Cancer Res 2024;84(6_Suppl):Abstract nr 3511.

Atrial fibrillation (AF) represents a hazardous cardiac arrhythmia that significantly elevates the risk of stroke and heart failure. Despite its severity, its diagnosis largely relies on the proficiency of health care professionals. At present, the real-time identification of paroxysmal AF is hindered by the lack of automated techniques. Consequently, a highly effective machine learning algorithm specifically designed for AF detection could offer substantial clinical benefits. We hypothesized that machine learning algorithms have the potential to identify and extract features of AF with a high degree of accuracy, given the intricate and distinctive patterns present in electrocardiogram (ECG) recordings of AF. This study aims to develop a clinically valuable machine learning algorithm that can accurately detect AF and compare different leads' performances of AF detection. We used 12-lead ECG recordings sourced from the 2020 PhysioNet Challenge data sets. The Welch method was used to extract power spectral features of the 12-lead ECGs within a frequency range of 0.083 to 24.92 Hz. Subsequently, various machine learning techniques were evaluated and optimized to classify sinus rhythm (SR) and AF based on these power spectral features. Furthermore, we compared the effects of different frequency subbands and different lead selections on machine learning performances. The light gradient boosting machine (LightGBM) was found to be the most effective in classifying AF and SR, achieving an average F1-score of 0.988 across all ECG leads. Among the frequency subbands, the 0.083 to 4.92 Hz range yielded the highest F1-score of 0.985. In interlead comparisons, aVR had the highest performance (F1=0.993), with minimal differences observed between leads. In conclusion, this study successfully used machine learning methodologies, particularly the LightGBM model, to differentiate SR and AF based on power spectral features derived from 12-lead ECGs. The performance marked by an average F1-score of 0.988 and minimal interlead variation underscores the potential of machine learning algorithms to bolster real-time AF detection. This advancement could significantly improve patient care in intensive care units as well as facilitate remote monitoring through wearable devices, ultimately enhancing clinical outcomes.

Challenging Datasets Research Articles

Related Topics

Articles published on Challenging Datasets

Zero-Shot Aerial Object Detection with Visual Description Regularization

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation

Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network

HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via Hypernetworks

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

Abstract 3511: iQC: machine-learning-driven prediction of surgery reveals systematic confounds in cancer whole slide images from hospitals by protocol

Efficient networks for textureless feature registration via free receptive field

Semantic segmentation of progressive micro-cracking in polymer composites using Attention U-Net architecture

Predicting Knee Joint Contact Forces During Normal Walking Using Kinematic Inputs With a Long-Short Term Neural Network.

MS2Rescore 3.0 Is a Modular, Flexible, and User-Friendly Platform to Boost Peptide Identifications, as Showcased with MS Amanda 3.0.

PFD-Net: Pyramid Fourier Deformable Network for medical image segmentation

CellViT: Vision Transformers for precise cell segmentation and classification

Progressive deep snake for instance boundary extraction in medical images

Optimization of Using Multiple Machine Learning Approaches in Atrial Fibrillation Detection Based on a Large-Scale Data Set of 12-Lead Electrocardiograms: Cross-Sectional Study.

Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery

Multi-residual 2D network integrating spatial correlation for whole heart segmentation

Brain MR image simulation for deep learning based medical image analysis networks

Generation of PAS-stained images of glomerular tissue units using a generative adversarial network with spectral normalization colorization method

Domain Generalization with Small Data

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Challenging Datasets Research Articles

Related Topics

Articles published on Challenging Datasets

Zero-Shot Aerial Object Detection with Visual Description Regularization

CamoDiffusion: Camouflaged Object Detection via Conditional Diffusion Models

Keypoint Fusion for RGB-D Based 3D Hand Pose Estimation

Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network

HyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via Hypernetworks

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

Abstract 3511: iQC: machine-learning-driven prediction of surgery reveals systematic confounds in cancer whole slide images from hospitals by protocol

Efficient networks for textureless feature registration via free receptive field

Semantic segmentation of progressive micro-cracking in polymer composites using Attention U-Net architecture

Predicting Knee Joint Contact Forces During Normal Walking Using Kinematic Inputs With a Long-Short Term Neural Network.

MS2Rescore 3.0 Is a Modular, Flexible, and User-Friendly Platform to Boost Peptide Identifications, as Showcased with MS Amanda 3.0.

PFD-Net: Pyramid Fourier Deformable Network for medical image segmentation

CellViT: Vision Transformers for precise cell segmentation and classification

Progressive deep snake for instance boundary extraction in medical images

Optimization of Using Multiple Machine Learning Approaches in Atrial Fibrillation Detection Based on a Large-Scale Data Set of 12-Lead Electrocardiograms: Cross-Sectional Study.

Surgical-DINO: adapter learning of foundation models for depth estimation in endoscopic surgery

Multi-residual 2D network integrating spatial correlation for whole heart segmentation

Brain MR image simulation for deep learning based medical image analysis networks

Generation of PAS-stained images of glomerular tissue units using a generative adversarial network with spectral normalization colorization method

Domain Generalization with Small Data