Geographic mapping with unsupervised multi-modal representation learning from VHR images and POIs

Lubin Bai,Weiming Huang,Xiuyuan Zhang,Shihong Du,Gao Cong,Haoyu Wang,Bo Liu

doi:10.1016/j.isprsjprs.2023.05.006

Abstract

Most supervised geographic mapping methods with very-high-resolution (VHR) images are designed for a specific task, leading to high label-dependency and inadequate task-generality. Additionally, the lack of socio-economic information in VHR images limits their applicability to social/human-related geographic studies. To resolve these two issues, we propose an unsupervised multi-modal geographic representation learning framework (MMGR) using both VHR images and points-of-interest (POIs), to learn representations (regional vector embeddings) carrying both the physical and socio-economic properties of the geographies. In MMGR, we employ an intra-modal and an inter-modal contrastive learning module, in which the former deeply mines visual features by contrasting different VHR image augmentations, while the latter fuses physical and socio-economic features by contrasting VHR image and POI features. Extensive experiments are performed in two study areas (Shanghai and Wuhan in China) and three relevant while distinctive geographic mapping tasks (i.e., mapping urban functional distributions, population density, and gross domestic product), to verify the superiority of MMGR. The results demonstrate that the proposed MMGR considerably outperforms seven competitive baselines in all three tasks, which indicates its effectiveness in fusing VHR images and POIs for multiple geographic mapping tasks. Furthermore, MMGR is a competent pre-training method to help image encoders understand multi-modal geographic information, and it can be further strengthened by fine-tuning even with a few labeled samples. The source code is released at https://github.com/bailubin/MMGR.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Geographic mapping with unsupervised multi-modal representation learning from VHR images and POIs

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing

Lead the way for us

Journal: ISPRS Journal of Photogrammetry and Remote Sensing	Publication Date: Jun 1, 2023
Citations: 16

Similar Papers

A Shapelet-Based Framework for Unsupervised Multivariate Time Series Representation Learning
Zhiyu Liang ... Jianfeng Zhang
Proceedings of the VLDB Endowment | VOL. 17
Zhiyu Liang, et. al.Zhiyu Liang ... Jianfeng Zhang
01 Nov 2023
Proceedings of the VLDB Endowment | VOL. 17

Automatic detection of charcoal kilns on Very High Resolution images with a computer vision approach in Somalia
Astrid Verhegghen ... Marijn Van Der Velde
International Journal of Applied Earth Observation and Geoinformation | VOL. 125
Astrid Verhegghen, et. al.Astrid Verhegghen ... Marijn Van Der Velde
08 Nov 2023
International Journal of Applied Earth Observation and Geoinformation | VOL. 125

Open-source data-driven urban land-use mapping integrating point-line-polygon semantic objects: A case study of Chinese cities
Yanfei Zhong ... Liangpei Zhang
Remote Sensing of Environment | VOL. 247
Yanfei Zhong, et. al.Yanfei Zhong ... Liangpei Zhang
28 May 2020
Remote Sensing of Environment | VOL. 247

Adapt and explore: Multimodal mixup for representation learning
Ronghao Lin ... Haifeng Hu
Information Fusion | VOL. 105
Ronghao Lin, et. al.Ronghao Lin ... Haifeng Hu
28 Dec 2023
Information Fusion | VOL. 105

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Geographic mapping with unsupervised multi-modal representation learning from VHR images and POIs

Abstract

Talk to us

Similar Papers

More From: ISPRS Journal of Photogrammetry and Remote Sensing