Abstract

Our study aimed to develop a machine learning (ML) model to accurately classify acute promyelocytic leukemia (APL) from other types of acute myeloid leukemia (other AML) using multicolor flow cytometry (MFC) data. Multicolor flow cytometry is used to determine immunophenotypes that serve as disease signatures for diagnosis. We used a data set of MFC files from 27 patients with APL and 41 patients with other AML, including those with uncommon immunophenotypes. Our ML pipeline involved training a graph neural network (GNN) to output graph-level labels and identifying the most crucial MFC parameters and cells for predictions using an input perturbation method. The top-performing GNN achieved 100% accuracy on the training/validation and test sets on classifying APL from other AML and used MFC parameters similarly to expert pathologists. Pipeline performance is amenable to use in a clinical decision support system, and our deep learning architecture readily enables prediction explanations. Our ML pipeline shows robust performance on predicting APL and could be used to screen for APL using MFC data. It also allowed for intuitive interrogation of the model's predictions by clinicians.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call