Recent research in spatial transcriptomics allows researchers to analyze gene expression without losing spatial information. Spatial information can assist in cell communication, identification of new cell subtypes, which provides important research methods for multiple fields such as microenvironment interactions and pathological processes of diseases. Identifying spatial domains is an important step in spatial transcriptomics analysis, and improving spatial clustering methods can benefit for identifying spatial domains. In addition to eliminating noise in original gene expression, how to use spatial information to assist clustering has also become a new problem. A variety of calculating methods have been applied to spatial clustering, including contrastive learning methods. However, existing spatial clustering methods based on contrastive learning use data augmentation to generate positive and negative pairs, which will inevitably destroy the biological meaning of the data. We propose a new self-supervised spatial clustering method based on contrastive learning, Augmentation-Free Spatial Clustering (AFSC), which integrates spatial information and gene expression to learn latent representations. We construct a contrastive learning module without negative pairs or data augmentation by designing Teacher and Student Encoder. We also design an unsupervised clustering module to make clustering and contrastive learning be trained together. Experiments on multiple spatial transcriptomics datasets at different resolutions demonstrate that our method performs well in self-supervised spatial clustering tasks. Furthermore, the learned representations can be used for various downstream tasks including visualization and trajectory inference.
Read full abstract