Partial label learning for automated classification of single-cell transcriptomic profiles.

Malek Senoussi,Paul Villoutreix,Thierry Artieres

doi:10.1371/journal.pcbi.1012006

Abstract

Single-cell RNA sequencing (scRNASeq) data plays a major role in advancing our understanding of developmental biology. An important current question is how to classify transcriptomic profiles obtained from scRNASeq experiments into the various cell types and identify the lineage relationship for individual cells. Because of the fast accumulation of datasets and the high dimensionality of the data, it has become challenging to explore and annotate single-cell transcriptomic profiles by hand. To overcome this challenge, automated classification methods are needed. Classical approaches rely on supervised training datasets. However, due to the difficulty of obtaining data annotated at single-cell resolution, we propose instead to take advantage of partial annotations. The partial label learning framework assumes that we can obtain a set of candidate labels containing the correct one for each data point, a simpler setting than requiring a fully supervised training dataset. We study and extend when needed state-of-the-art multi-class classification methods, such as SVM, kNN, prototype-based, logistic regression and ensemble methods, to the partial label learning framework. Moreover, we study the effect of incorporating the structure of the label set into the methods. We focus particularly on the hierarchical structure of the labels, as commonly observed in developmental processes. We show, on simulated and real datasets, that these extensions enable to learn from partially labeled data, and perform predictions with high accuracy, particularly with a nonlinear prototype-based method. We demonstrate that the performances of our methods trained with partially annotated data reach the same performance as fully supervised data. Finally, we study the level of uncertainty present in the partially annotated data, and derive some prescriptive results on the effect of this uncertainty on the accuracy of the partial label learning methods. Overall our findings show how hierarchical and non-hierarchical partial label learning strategies can help solve the problem of automated classification of single-cell transcriptomic profiles, interestingly these methods rely on a much less stringent type of annotated datasets compared to fully supervised learning methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Partial label learning for automated classification of single-cell transcriptomic profiles.

Abstract

Talk to us

Similar Papers

More From: PLOS Computational Biology

Lead the way for us

Journal: PLOS Computational Biology	Publication Date: Apr 5, 2024
License type: CC BY 4.0

Similar Papers

Partial Label Learning with Batch Label Correction
Yan Yan ... Yuhong Guo
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Yan Yan, et. al.Yan Yan ... Yuhong Guo
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Towards Enabling Binary Decomposition for Partial Label Learning
Xuan Wu ... Min-Ling Zhang
-
Xuan Wu, et. al.Xuan Wu ... Min-Ling Zhang
01 Jul 2018
01 Jul 2018

A Generative Model for Partial Label Learning
Yan Yan ... Shining Li
-
Yan Yan, et. al.Yan Yan ... Shining Li
05 Jul 2021
05 Jul 2021

Learning with Noisy Partial Labels by Simultaneously Leveraging Global and Local Consistencies
Changchun Li ... Jihong Ouyang
-
Changchun Li, et. al.Changchun Li ... Jihong Ouyang
19 Oct 2020
19 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Partial label learning for automated classification of single-cell transcriptomic profiles.

Abstract

Talk to us

Similar Papers

More From: PLOS Computational Biology