Multi-instance Learning Research Articles

Abstract Current flow cytometric analysis of blood and bone marrow samples for the diagnosis of acute leukemias relies heavily on manual intervention in both the processing and analysis steps, introducing significant subjectivity into the resulting diagnosis and increasing diagnostic turn-around time. Additionally, concurrent molecular characterization of these samples via cytogenetics and targeted sequencing panels can take multiple days, thereby delaying patient diagnosis and treatment. Attention-based multi-instance learning models are machine learning models that can make accurate predictions and generate interpretable insights regarding the classification of a sample from multiple events/cells; while these models have been developed for anatomic pathology applications, they have yet to be applied to flow cytometry data. By utilizing 1,820 flow cytometry samples from 2019-2022 at Brigham and Women’s Hospital, we developed attention-based multi-instance machine learning models for automated diagnosis of acute leukemia, including differentiation of acute myeloid leukemia (AML) from B-lymphoblastic leukemia/lymphoma (B-ALL). Additionally, using concurrent cytogenetic and targeted sequencing data from 674 acute leukemia samples, machine learning models for prediction of molecular aberrancies from flow cytometry data were developed. Machine learning models were created using the TabNet deep learning architecture and VIME self-supervised training algorithm, which are state-of-the-art approaches towards machine learning from tabular data including flow cytometry data. Attention-based multi-instance models demonstrated strong performance for the automated diagnosis of acute leukemia versus non-leukemia samples (AUROC 0.869), as well as the separation of AML from B-ALL samples (AUROC 0.971). These models also accurately predicted cytogenetic aberrancies among AML samples, including t(15;17);PML::RARA (AUROC 1.00), as well as mutations including NPM1 (AUROC 0.725). These models do not require any manual intervention including compensation or gating, and additionally provide quantitative scores for the relative importance of different flow cytometry events and markers for the diagnosis of a particular sample. These importance scores can be integrated into flow cytometry analysis software for visualization and interpretation by hematopathologists. In this study, we have demonstrated the capability of machine learning models to provide automated diagnoses of acute leukemia, as well as accurately predict cytogenetic and molecular aberrancies in blood and bone marrow samples using flow cytometry data. This automated workflow can significantly decrease diagnostic turn-around time and ultimately improve patient outcomes.

Multiple myeloma (MM) is a plasma cell neoplasm and the second most common hematologic malignancy. Current guidelines recommend multiple myeloma cases undergo karyotyping and fluorescence in situ hybridization (FISH) analysis on bone marrow biopsy samples to assess for specific recurrent genetic abnormalities. Results of this testing help to predict differences in patient prognosis over time and to guide appropriate selection of therapy. Refinements in risk stratification incorporated additional complex molecular testing which, along with karyotyping and FISH, are costly and not available to all patients. Myeloma cell cytomorphologies have been associated in a subset of cases with more aggressive disease, when plasmablastic, and with a subset of t(11;14) cases, when lymphoplasmacytoid. Nonetheless, cellular features as assessed by manual microscopy have not been shown to identify biologic subtypes of disease reproducibly and broadly, such that these features are not a component of current prognostic systems. Whether accurate risk-stratification information can be extracted from microscopic images of myeloma cells through machine-based approaches is uncertain. Thus, in this study, we tested the feasibility of using machine learning models to predict multiple myeloma molecular genetic subtypes from analysis of neoplastic plasma cell morphology on Wright-stained aspirate smears. We first improved upon a previously developed computational pipeline that identifies and classifies hematopoietic cells from scanned whole-slide images (400x) of bone marrow aspirate smears, including classification of plasma cells with 94% accuracy. Using this pipeline, we obtained images from all plasma cells (up to 68,569 plasma cells per slide) from aspirate smears of 18 patients without plasma cell neoplasms and 96 patients with plasma cell neoplasms (including 20 with t(11;14) and 23 with gain(1q); 44 with standard risk FISH and 22 with high risk FISH (t(4;14), t(14;16), del(17p), or &gt;4 copies 1q+)). We then developed convolutional neural network (CNN)-based multi-instance machine learning models to perform patient-level classifications of disease status and molecular genetic classification from image-level analysis of plasma cell cytomorphology. These models demonstrated strong performance for classifying patients without versus with plasma cell neoplasms (AUROC 0.80), and are capable of making patient-level predictions of multiple myeloma molecular genetic subtypes (t(11;14), AUROC 0.73; gain(1q), AUROC 0.71) as well as prediction of FISH risk level (AUROC 0.68). Finally, by assessing cell attention scores and CNN model weights, this machine learning pipeline can assess which individual plasma cells and which specific morphologic features provided the most utility in the prediction of molecular genetic subtypes for individual patients, hence providing explainability for model output. Importantly, our findings suggest that cytomorphologic features of plasma cells on routine aspirate smear slides contain information, extractable through machine-based approaches, that correlates with molecular genetic subtypes employed for risk stratification. With further refinement these promising computational digital pathology models can potentially yield tools not only for use in low resource settings, but also provide a potential basis for development of multimodal models that incorporate and improve upon results of currently used risk stratification tools.

Multi-instance Learning Research Articles

Related Topics

Articles published on Multi-instance Learning

Deep learning-enabled classification of kidney allograft rejection on whole slide histopathologic images.

Boosting Weakly Supervised Object Localization and Segmentation With Domain Adaption.

Weakly supervised large-scale pancreatic cancer detection using multi-instance learning.

SAFE-MIL: a statistically interpretable framework for screening potential targeted therapy patients based on risk estimation.

Fast Broad Multiview Multi-Instance Multilabel Learning (FBM3L) With Viewwise Intercorrelation.

A Robust Open-Set Multi-Instance Learning for Defending Adversarial Attacks in Digital Image

Explaining Anomalous Events in Flight Data of UAV With Deep Attention-Based Multi-Instance Learning

Automated identification of protein expression intensity and classification of protein cellular locations in mouse brain regions from immunofluorescence images.

DCAMIL: Eye-tracking guided dual-cross-attention multi-instance learning for refining fundus disease detection

Identifying Student Profiles Within Online Judge Systems Using Explainable Artificial Intelligence

A Causality-Driven Graph Convolutional Network for Postural Abnormality Diagnosis in Parkinsonians.

Regularized Optimal Transport Layers for Generalized Global Pooling Operations.

Automated Machine Learning-Based Diagnosis and Molecular Characterization of Acute Leukemias using Flow Cytometry Data

Masked autoencoders with handcrafted feature predictions: Transformer for weakly supervised esophageal cancer classification

A multi-instance multi-label learning algorithm based on radial basis functions and multi-objective particle swarm optimization

Classification of Emotional and Immersive Outcomes in the Context of Virtual Reality Scene Interactions.

Automated Deep Learning-Based Diagnosis and Molecular Characterization of Acute Myeloid Leukemia Using Flow Cytometry

Machine Learning Models Predict Molecular Genetic Subtypes of Multiple Myeloma from Whole-Slide Bone Marrow Aspirate Smears

Shared-Specific Feature Learning With Bottleneck Fusion Transformer for Multi-Modal Whole Slide Image Analysis.

Multi-Instance Learning Approach to the Modeling of Enantioselectivity of Conformationally Flexible Organic Catalysts.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-instance Learning Research Articles

Related Topics

Articles published on Multi-instance Learning

Deep learning-enabled classification of kidney allograft rejection on whole slide histopathologic images.

Boosting Weakly Supervised Object Localization and Segmentation With Domain Adaption.

Weakly supervised large-scale pancreatic cancer detection using multi-instance learning.

SAFE-MIL: a statistically interpretable framework for screening potential targeted therapy patients based on risk estimation.

Fast Broad Multiview Multi-Instance Multilabel Learning (FBM3L) With Viewwise Intercorrelation.

A Robust Open-Set Multi-Instance Learning for Defending Adversarial Attacks in Digital Image

Explaining Anomalous Events in Flight Data of UAV With Deep Attention-Based Multi-Instance Learning

Automated identification of protein expression intensity and classification of protein cellular locations in mouse brain regions from immunofluorescence images.

DCAMIL: Eye-tracking guided dual-cross-attention multi-instance learning for refining fundus disease detection

Identifying Student Profiles Within Online Judge Systems Using Explainable Artificial Intelligence

A Causality-Driven Graph Convolutional Network for Postural Abnormality Diagnosis in Parkinsonians.

Regularized Optimal Transport Layers for Generalized Global Pooling Operations.

Automated Machine Learning-Based Diagnosis and Molecular Characterization of Acute Leukemias using Flow Cytometry Data

Masked autoencoders with handcrafted feature predictions: Transformer for weakly supervised esophageal cancer classification

A multi-instance multi-label learning algorithm based on radial basis functions and multi-objective particle swarm optimization

Classification of Emotional and Immersive Outcomes in the Context of Virtual Reality Scene Interactions.

Automated Deep Learning-Based Diagnosis and Molecular Characterization of Acute Myeloid Leukemia Using Flow Cytometry

Machine Learning Models Predict Molecular Genetic Subtypes of Multiple Myeloma from Whole-Slide Bone Marrow Aspirate Smears

Shared-Specific Feature Learning With Bottleneck Fusion Transformer for Multi-Modal Whole Slide Image Analysis.

Multi-Instance Learning Approach to the Modeling of Enantioselectivity of Conformationally Flexible Organic Catalysts.