Spatial transcriptomics allows for the measurement of high-throughput gene expression data while preserving the spatial structure of tissues and histological images. Integrating gene expression, spatial information, and image data to learn discriminative low-dimensional representations is critical for dissecting tissue heterogeneity and analyzing biological functions. However, most existing methods have limitations in effectively utilizing spatial information and high-resolution histological images. We propose a signal-diffusion-based unsupervised contrast learning method (SDUCL) for learning low-dimensional latent embeddings of cells/spots. SDUCL integrates image features, spatial relationships and gene expression information. We designed a signal diffusion microenvironment discovery algorithm, which effectively captures and integrates interaction information within the cellular microenvironment by simulating the biological signal diffusion process. By maximizing the mutual information between the local representation and the microenvironment representation of cells/spots, SDUCL learns more discriminative representations. SDUCL was employed to analyze spatial transcriptomics datasets from multiple species, encompassing both normal and tumor tissues. SDUCL performed well in downstream tasks such as clustering, visualization, trajectory inference, and differential gene analysis, thereby enhancing our understanding of tissue structure and tumor microenvironments. https://github.com/WeiMin-Li-visual/SDUCL. Supplementary data are available at Bioinformatics online.
Read full abstract