Abstract

Acute Myeloid Leukaemia (AML) is a phenotypically and genetically heterogenous blood cancer characterised by very poor prognosis, with disease relapse being the primary cause of treatment failure. AML heterogeneity arise from different genetic and non-genetic sources, including its proposed hierarchical structure, with leukemic stem cells (LSCs) and progenitors giving origin to a variety of more mature leukemic subsets. Recent advances in single-cell molecular and phenotypic profiling have highlighted the intra and inter-patient heterogeneous nature of AML, which has so far limited the success of cell-based immunotherapy approaches against single targets. Machine Learning (ML) can be uniquely used to find non-trivial patterns from high-dimensional datasets and identify rare sub-populations. Here we review some recent ML tools that applied to single-cell data could help disentangle cell heterogeneity in AML by identifying distinct core molecular signatures of leukemic cell subsets. We discuss the advantages and limitations of unsupervised and supervised ML approaches to cluster and classify cell populations in AML, for the identification of biomarkers and the design of personalised therapies.

Highlights

  • Acute Myeloid Leukaemia (AML) is an aggressive and fast-progressing leukaemia characterised by the accumulation of myeloid progenitors [1]

  • leukemic stem cells (LSCs) lie at the top of AML cellular hierarchies, and carry an unlimited ability to self-renew as well as giving origin to a variety of more mature leukemic subsets [1], each expressing characteristic patterns of cell surface markers

  • Machine Learning (ML) has not reached its full potential for the characterisation of AML cell populations at single-cell resolution, partly due to the recent development of large datasets [5, 15,16,17,18]

Read more

Summary

INTRODUCTION

AML is an aggressive and fast-progressing leukaemia characterised by the accumulation of myeloid progenitors [1]. The output of the model is a weighted list of feature genes characteristic of every cluster that often include known markers for a given cell type and could potentially be used to detect common biomarkers of leukemic cell subsets from AML patients. Another unsupervised method, single-cell consensus clustering (SC3) uses the first 4-7% * N (number of cells) eigenvectors to build multiple k-means clustering solutions [21]. When applied to a multiomics dataset generated from human bone marrow samples [45], it showed that the combination of surface proteins and gene

Method
Findings
DISCUSSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call