An Artificial Intelligence (AI) Approach for Automated Flow Cytometric Diagnosis of B-Cell Lymphoma

Alexander Höllein,Max Zhao,Richard Schabath,Torsten Haferlach,Claudia Haferlach,Peter Krawitz,Wolfgang Kern

doi:10.1182/blood-2018-99-113797

Alexander Höllein, Max Zhao + Show 5 more

Open Access

https://doi.org/10.1182/blood-2018-99-113797

Copy DOI

Abstract

Background: Multi-parameter flow cytometry (MFC) data is a diagnostic keystone for hematologic malignancies. Data analysis and interpretation requires resources and expertise, full automation is desired. Combination of automated gating strategies and machine learning on MFC data proved effective in detecting AML cell populations vs normal (Aghaeepour et al., 2013). Aim of our approach was to create tools capable of comprehensively analyzing and classifying MFC data. We developed a proof of principle framework to process MFC data generated from samples with suspected B-cell lymphomas.Methods: MFC data was obtained from routine diagnostics of 16,384 patients as follows: normal controls (n=8,493), chronic lymphocytic leukemia (CLL, n=3,412), CLL with increased prolymphocytes (CLL/PL, n=603), follicular lymphoma (FL, n=219), hairy cell leukemia (HCL, n=193), lymphoplasmacytic lymphoma (LPL, n=629), mantle cell lymphoma (MCL, n=269), marginal zone lymphoma (MZL, n=979), monoclonal B-cell lymphocytosis (MBL, n=1,480). Two 9-color combinations of monoclonal antibodies were applied to analyze surface expression of the following antigens besides scatter signals: FMC7, CD10, IgM, CD79b, CD20, CD23, CD19, CD5, CD45, Kappa, Lambda, CD38, CD25, CD11c, CD103, CD22. For dimensionality reduction we used an unsupervised clustering approach (Kohonen et al., 1990). We first generated a consensus self-organizing map (CS) of 100 nodes including all MFC events of multiple preselected cases (5 samples/cohort). For a test case the fluorescent profile of a cell was assigned to its nearest node in the CS (upsampling). We used the resulting distribution for classification by a sequential neural network (NN). Computationally intensive processing steps were run using Amazon Web Services. Mean accuracies were calculated counting individual cases across the entire dataset. All accuracies are described for 10 CS generations and 10-fold cross validation in the classification process. Approaches of data processing were sequentially developed aiming at maximization of classification accuracy.Results: First, we assessed the performance in a binary classification, i.e. the merged group of CLL and MBL (n=4,877) vs. normal (n=8,493) achieving 97% accuracy. Secondly, we applied our approach to the classification of all 9 classes with an accuracy of 74%. Visual inspection of tSNE (van der Maaten et al., 2008) transformed upsampling information revealed good separability between CLL and normal cases but substantial overlap between other lymphoma subtypes (Figure). The following approaches were applied to improve classification:1) Automated pregating of lymphocytes (CD45/SSC) by DBSCAN analysis (Ester et al., 1996) improved classification. Applying this to both CS generation and upsampling the accuracy of the 9-class problem increased to 78% (p<0.001).2) Using a CS trained only on cases with unequivocal phenotypes without pregating was significantly better (p=0.01), but pregating overrode this effect (p=0.5).3) According to the partial overlap of immunophenotypes hierarchical clustering of misclassifications showed overlap mainly between MCL and CLL/PL as well as between LPL and MZL. We therefore changed our approach merging the groups CLL/MBL, MCL/CLL/PL and LPL/MZL while keeping FL, HCL and normal controls separate. Automated classification of these 6 classes resulted in 86% accuracy.4) Further regrouping into 3 classes, according to the commonly applied routine approach of distinguishing normal cases from CD5+ (CLL, MBL, MCL, CLL/PL) and CD5- B-cell lymphoma (FL, LPL, MZL, HCL) resulted in a classification accuracy of 89%.5) The misclassified cases in the 3-class experiment had an average infiltration of 10% versus 41% for correctly classified ones. Limiting the automated classification to cases with ≥5% infiltration increased the classification accuracy to 95%.Conclusion: We created a framework for automated classification and analysis of routine diagnostic MFC data. Definition of entities and preselection of cell populations are essential for optimizing classification accuracy. Our approach is the first application of computational MFC and AI on a large scale dataset and proves the feasibility in a routine diagnostic work flow. To further improve the accuracy of classification a future algorithm will integrate distance metrics following SOM generation into the NN classifier. [Display omitted] DisclosuresHöllein:MLL Munich Leukemia Laboratory: Employment. Schabath:MLL Munich Leukemia Laboratory: Employment. Haferlach:MLL Munich Leukemia Laboratory: Employment, Equity Ownership. Haferlach:MLL Munich Leukemia Laboratory: Employment, Equity Ownership. Kern:MLL Munich Leukemia Laboratory: Employment, Equity Ownership.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Artificial Intelligence (AI) Approach for Automated Flow Cytometric Diagnosis of B-Cell Lymphoma

Abstract

Talk to us

Similar Papers

More From: Blood

Lead the way for us

Similar Papers

Cyclin D1 (CCND1) mRNA Expression Assessed by Real-Time PCR Is Increased in Patients with Mantle Cell Lymphoma and Contributes to Diagnosis in B-Cell Lymphoma Cases
Ulrike Bacher ... Susanne Schnittger
Blood | VOL. 120
Ulrike Bacher, et. al.Ulrike Bacher ... Susanne Schnittger
16 Nov 2012
Blood | VOL. 120

Evaluation of BRAFV600E, NRAS and KRAS Mutations As Well As IGHV Usage for Diagnostic Use in Hairy Cell Leukemia, Hairy Cell Leukemia-Variant and Splenic Marginal Zone Lymphoma.
Susanne Schnittger ... Wolfgang Kern
Blood | VOL. 120
Susanne Schnittger, et. al.Susanne Schnittger ... Wolfgang Kern
16 Nov 2012
Blood | VOL. 120

Characterization of MYD88 mutated Lymphoplasmacytic Lymphoma in Comparison to MYD88 mutated Chronic Lymphocytic Leukemia
Constance Regina Baer ... Claudia Haferlach
Blood | VOL. 126
Constance Regina Baer, et. al.Constance Regina Baer ... Claudia Haferlach
03 Dec 2015
Blood | VOL. 126

Cytogenetic Aberrations In Lymphoplasmacytic Lymphoma (LPL): A Study On 166 Cases
Wolfgang Kern ... Claudia Haferlach
Blood | VOL. 116
Wolfgang Kern, et. al.Wolfgang Kern ... Claudia Haferlach
19 Nov 2010
Blood | VOL. 116

Journal: Blood	Publication Date: Nov 29, 2018
Citations: 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Artificial Intelligence (AI) Approach for Automated Flow Cytometric Diagnosis of B-Cell Lymphoma

Abstract

Talk to us

Similar Papers

More From: Blood