Image Tiles Research Articles

Introduction Overlapping clinical, molecular, and histopathological characteristics pose challenges in differentiating prePMF from ET. The median overall survival, however, significantly differs between prePMF and ET (11.9 vs 22.2 years, Jeryczynski, 2017). The difference in survival highlights the need to distinguish between these two myeloproliferative neoplasms (MPNs) to select disease-specific therapeutic options. This area of unmet need often requires expert assessment at high-volume academic institutions to render a definitive diagnosis. Our aim in this study is to develop and validate a biologically-motivated AI algorithm to rapidly, accurately, and inexpensively diagnose prePMF and ET directly from diagnostic bone marrow (BM) biopsy digital whole-slide images (WSI). Methods Patients with a clinical/histopathological diagnosis of prePMF or ET as determined by the International Consensus Classification of Myeloid Neoplasms were identified at the University of Florence, Italy (Florence) between 06/2007 and 05/2023 and Moffitt Cancer Center, Tampa, FL (Moffitt) between 01/2013 and 01/2022. Diagnostic H&E-stained BM biopsy slides were digitized using Aperio AT2 slide scanners (Leica Biosystems, Deer Park, IL) at each institution . The training cohort comprised of 200 (100 prePMF / 100 ET) patients from Florence, and the external test cohort entailed 26 (6 prePMF / 20 ET) patients from Moffitt. In total, the resultant model was trained on 32,226 patient-derived WSI. Our chosen pretrained neural network, RetCCL, was previously trained on 32,000 diagnostic WSIs to potentially represent a histologically-informed model (Wang, 2023). BM WSI were tessellated into representative image tiles extracted at 10x magnification (302 microns per image dimensions) for model training. Finally, a prediction upon each patient's WSI was calculated by attention-based multiple instance learning, which is a method that automatically assigns a numeric weight to an image portion representing its relative importance to the classification task. Model performance was assessed utilizing the area under the receiver operator curve (AUC). The cutoff threshold for diagnosis classification was determined by maximizing Youden's Index. For qualitative assessment, attention scores were plotted as a heatmap across the BM WSI and reviewed for morphological features by an expert hematopathologist. Custom scripts were written using our open-source AI framework, Slideflow (Dolezal, 2021). Model development was performed on the Minerva High Performance Computer at Mount Sinai Hospital. Evaluation time upon a single WSI was estimated using a consumer-grade computer with an NVIDIA RTX 3080 graphics processing unit. Results Within the training cohort, 5-fold cross validation resulted in a mean AUC of 0.90 and standard deviation of 0.04. A final locked model re-trained on the entire training cohort resulted in an AUC of 0.90 upon evaluation of the test cohort ( Figure 1). We optimized the classification threshold to balance sensitivity and specificity; the final diagnostic classification accuracy on the test cohort was 92.3% with a sensitivity and specificity for prePMF diagnosis of 66.6% and 100%, respectively. Upon review of the slides with highest prediction value per class, attention heatmaps highlighted the model's reliance on areas of cellular marrow without reliance on image artifacts or background ( Figure 2). Using affordable consumer-grade hardware, evaluation upon a previously unseen WSI was completed in approximately 6.1 seconds (4.9 for preprocessing and 1.2 for evaluation). Conclusion We developed a novel AI model with high accuracy for distinguishing between prePMF and ET in distinct clinical cohorts. To our knowledge, this study represents the largest image-based AI study within MPNs with external validation. Our proposed model may assist clinicians in appropriately identifying patient cohorts who would benefit from disease-specific therapies or enrollment onto clinical trials. We imagine that a potential high-speed, low-cost algorithm may reliably distinguish prePMF from ET patients with high specificity which can be democratized to the MPN clinical community in routine practice and drive clinical trial accrual for biologically rational novel therapeutics.

Read full abstract

Histopathological examination is a crucial step in the diagnosis and treatment of many major diseases. Aiming to facilitate diagnostic decision making and improve the workload of pathologists, we developed an artificial intelligence (AI)-based prescreening tool that analyses whole-slide images (WSIs) of large-bowel biopsies to identify typical, non-neoplastic, and neoplastic biopsies. This retrospective cohort study was conducted with an internal development cohort of slides acquired from a hospital in the UK and three external validation cohorts of WSIs acquired from two hospitals in the UK and one clinical laboratory in Portugal. To learn the differential histological patterns from digitised WSIs of large-bowel biopsy slides, our proposed weakly supervised deep-learning model (Colorectal AI Model for Abnormality Detection [CAIMAN]) used slide-level diagnostic labels and no detailed cell or region-level annotations. The method was developed with an internal development cohort of 5054 biopsy slides from 2080 patients that were labelled with corresponding diagnostic categories assigned by pathologists. The three external validation cohorts, with a total of 1536 slides, were used for independent validation of CAIMAN. Each WSI was classified into one of three classes (ie, typical, atypical non-neoplastic, and atypical neoplastic). Prediction scores of image tiles were aggregated into three prediction scores for the whole slide, one for its likelihood of being typical, one for its likelihood of being non-neoplastic, and one for its likelihood of being neoplastic. The assessment of the external validation cohorts was conducted by the trained and frozen CAIMAN model. To evaluate model performance, we calculated area under the convex hull of the receiver operating characteristic curve (AUROC), area under the precision-recall curve, and specificity compared with our previously published iterative draw and rank sampling (IDaRS) algorithm. We also generated heat maps and saliency maps to analyse and visualise the relationship between the WSI diagnostic labels and spatial features of the tissue microenvironment. The main outcome of this study was the ability of CAIMAN to accurately identify typical and atypical WSIs of colon biopsies, which could potentially facilitate automatic removing of typical biopsies from the diagnostic workload in clinics. A randomly selected subset of all large bowel biopsies was obtained between Jan 1, 2012, and Dec 31, 2017. The AI training, validation, and assessments were done between Jan 1, 2021, and Sept 30, 2022. WSIs with diagnostic labels were collected between Jan 1 and Sept 30, 2022. Our analysis showed no statistically significant differences across prediction scores from CAIMAN for typical and atypical classes based on anatomical sites of the biopsy. At 0·99 sensitivity, CAIMAN (specificity 0·5592) was more accurate than an IDaRS-based weakly supervised WSI-classification pipeline (0·4629) in identifying typical and atypical biopsies on cross-validation in the internal development cohort (p<0·0001). At 0·99 sensitivity, CAIMAN was also more accurate than IDaRS for two external validation cohorts (p<0·0001), but not for a third external validation cohort (p=0·10). CAIMAN provided higher specificity than IDaRS at some high-sensitivity thresholds (0·7763 vs 0·6222 for 0·95 sensitivity, 0·7126 vs 0·5407 for 0·97 sensitivity, and 0·5615 vs 0·3970 for 0·99 sensitivity on one of the external validation cohorts) and showed high classification performance in distinguishing between neoplastic biopsies (AUROC 0·9928, 95% CI 0·9927-0·9929), inflammatory biopsies (0·9658, 0·9655-0·9661), and atypical biopsies (0·9789, 0·9786-0·9792). On the three external validation cohorts, CAIMAN had AUROC values of 0·9431 (95% CI 0·9165-0·9697), 0·9576 (0·9568-0·9584), and 0·9636 (0·9615-0·9657) for the detection of atypical biopsies. Saliency maps supported the representation of disease heterogeneity in model predictions and its association with relevant histological features. CAIMAN, with its high sensitivity in detecting atypical large-bowel biopsies, might be a promising improvement in clinical workflow efficiency and diagnostic decision making in prescreening of typical colorectal biopsies. The Pathology Image Data Lake for Analytics, Knowledge and Education Centre of Excellence; the UK Government's Industrial Strategy Challenge Fund; and Innovate UK on behalf of UK Research and Innovation.

Read full abstract

Image Tiles Research Articles

Related Topics

Articles published on Image Tiles

ScabyNet, a user-friendly application for detecting common scab in potato tubers using deep learning and morphological traits

PathEX: Make good choice for whole slide image extraction.

Neighborhood attention transformer multiple instance learning for whole slide image classification.

Single-cell Heterogeneity-aware Transformer-guided Multiple Instance Learning for Cancer Aneuploidy Prediction from Whole Slide Histopathology Images.

Weakly Supervised Classification for Nasopharyngeal Carcinoma with Transformer in Whole Slide Images.

Classifier-guided multi-style tile image generation method

Msemalign: a pipeline for serial section multibeam scanning electron microscopy volume alignment.

Development of Automated Risk Stratification for Sporadic Odontogenic Keratocyst Whole Slide Images with an Attention-Based Image Sequence Analyzer

Assessment of the large-scale extraction of photovoltaic (PV) panels with a workflow based on artificial neural networks and algorithmic postprocessing of vectorization results

Interpretable Artificial Intelligence (AI) Differentiates Prefibrotic Primary Myelofibrosis (prePMF) from Essential Thrombocythemia (ET): A Multi-Center Study of a New Clinical Decision Support Tool

Automatic Characterization of Boulders on Planetary Surfaces From High‐Resolution Satellite Images

Development and validation of artificial intelligence-based prescreening of large-bowel biopsies taken in the UK and Portugal: a retrospective cohort study

Manifold Explorer: Satellite Image Labelling and Clustering Tool with Using Deep Convolutional Autoencoders

A Federated Learning Approach to Tumor Detection in Colon Histology Images.

Deep Learning–Enabled Diagnosis of Liver Adenocarcinoma

Synthetic whole-slide image tile generation with gene expression profile-infused deep generative models

Region of interest (ROI) selection using vision transformer for automatic analysis using whole slide images

Computational textural mapping harmonises sampling variation and reveals multidimensional histopathological fingerprints

Inconsistency Detection in Cross-Layer Tile Maps with Super-Pixel Segmentation

Efficient Management and Scheduling of Massive Remote Sensing Image Datasets

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Image Tiles Research Articles

Related Topics

Articles published on Image Tiles

ScabyNet, a user-friendly application for detecting common scab in potato tubers using deep learning and morphological traits

PathEX: Make good choice for whole slide image extraction.

Neighborhood attention transformer multiple instance learning for whole slide image classification.

Single-cell Heterogeneity-aware Transformer-guided Multiple Instance Learning for Cancer Aneuploidy Prediction from Whole Slide Histopathology Images.

Weakly Supervised Classification for Nasopharyngeal Carcinoma with Transformer in Whole Slide Images.

Classifier-guided multi-style tile image generation method

Msemalign: a pipeline for serial section multibeam scanning electron microscopy volume alignment.

Development of Automated Risk Stratification for Sporadic Odontogenic Keratocyst Whole Slide Images with an Attention-Based Image Sequence Analyzer

Assessment of the large-scale extraction of photovoltaic (PV) panels with a workflow based on artificial neural networks and algorithmic postprocessing of vectorization results

Interpretable Artificial Intelligence (AI) Differentiates Prefibrotic Primary Myelofibrosis (prePMF) from Essential Thrombocythemia (ET): A Multi-Center Study of a New Clinical Decision Support Tool

Automatic Characterization of Boulders on Planetary Surfaces From High‐Resolution Satellite Images

Development and validation of artificial intelligence-based prescreening of large-bowel biopsies taken in the UK and Portugal: a retrospective cohort study

Manifold Explorer: Satellite Image Labelling and Clustering Tool with Using Deep Convolutional Autoencoders

A Federated Learning Approach to Tumor Detection in Colon Histology Images.

Deep Learning–Enabled Diagnosis of Liver Adenocarcinoma

Synthetic whole-slide image tile generation with gene expression profile-infused deep generative models

Region of interest (ROI) selection using vision transformer for automatic analysis using whole slide images

Computational textural mapping harmonises sampling variation and reveals multidimensional histopathological fingerprints

Inconsistency Detection in Cross-Layer Tile Maps with Super-Pixel Segmentation

Efficient Management and Scheduling of Massive Remote Sensing Image Datasets