On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

Gengchen Mai,Chris Cundy,Ziyuan Li,Song Gao,Tianming Liu,Yingjie Hu,Weiming Huang,Gao Cong,Ninghao Liu,Deepak Mishra,Suhang Song,Ni Lao,Rui Zhu,Jin Sun

doi:10.1145/3653070

Abstract

Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have not yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI. We first investigate the potential of many existing FMs by testing their performances on seven tasks across multiple geospatial domains, including Geospatial Semantics, Health Geography, Urban Geography, and Remote Sensing. Our results indicate that on several geospatial tasks that only involve text modality, such as toponym recognition, location description recognition, and US state-level/county-level dementia time series forecasting, the task-agnostic large learning models (LLMs) can outperform task-specific fully supervised models in a zero-shot or few-shot learning setting. However, on other geospatial tasks, especially tasks that involve multiple data modalities (e.g., POI-based urban function classification, street view image–based urban noise intensity classification, and remote sensing image scene classification), existing FMs still underperform task-specific models. Based on these observations, we propose that one of the major challenges of developing an FM for GeoAI is to address the multimodal nature of geospatial tasks. After discussing the distinct challenges of each geospatial data modality, we suggest the possibility of a multimodal FM that can reason over various types of geospatial data through geospatial alignments. We conclude this article by discussing the unique risks and challenges to developing such a model for GeoAI.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Spatial Algorithms and Systems

Lead the way for us

Journal: ACM Transactions on Spatial Algorithms and Systems	Publication Date: Jun 30, 2024
Citations: 11

Similar Papers

Deep Semantic-Visual Alignment for zero-shot remote sensing image scene classification
Wenjia Xu ... Yirong Wu
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 198
Wenjia Xu, et. al.Wenjia Xu ... Yirong Wu
14 Mar 2023
ISPRS Journal of Photogrammetry and Remote Sensing | VOL. 198

Extraction of Substance Use Information From Clinical Notes: Generative Pretrained Transformer-Based Investigation.
Fatemeh Shah-Mohammadi ... Joseph Finkelstein
JMIR medical informatics | VOL. 12
Fatemeh Shah-Mohammadi, et. al.Fatemeh Shah-Mohammadi ... Joseph Finkelstein
19 Aug 2024
JMIR medical informatics | VOL. 12

An Open Set Domain Adaptation Algorithm via Exploring Transferability and Discriminability for Remote Sensing Image Scene Classification
Jun Zhang ... Bin Pan
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Jun Zhang, et. al.Jun Zhang ... Bin Pan
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

CLRS: Continual Learning Benchmark for Remote Sensing Image Scene Classification.
Haifeng Li ... Chao Tao
Sensors | VOL. 20
Haifeng Li, et. al.Haifeng Li ... Chao Tao
24 Feb 2020
Sensors | VOL. 20

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Spatial Algorithms and Systems