The liver performs several vital functions such as metabolism, toxin removal, and glucose storage through the coordination of various cell types. With the recent breakthrough of the single-cell/single-nucleus RNA-seq (sc/snRNA-seq) techniques, there is a great opportunity to establish a reference cell map of the liver at single-cell resolution with transcriptome-wise features. In this study, we build a unified liver cell atlas uniLIVER (http://lifeome.net/database/uniliver) by integrative analysis of a large-scale sc/snRNA-seq data collection of normal human liver with 331,125 cells and 79 samples from 6 datasets. Moreover, we introduce LiverCT, a novel machine learning based method for mapping any query dataset to the liver reference map by introducing the definition of "variant" cellular states analogy to the sequence variants in genomic analysis. Applying LiverCT on liver cancer datasets, we find that the "deviated" states of T cells are highly correlated with the stress pathway activities in hepatocellular carcinoma, and the enrichments of tumor cells with the hepatocyte-cholangiocyte "intermediate" states significantly indicate poor prognosis. Besides, we find the tumor cells of different patients have different zonation tendencies and this zonation tendency is also significantly associated with the prognosis. This reference atlas mapping framework can also be extended to any other tissues.
Read full abstract