Skip to main navigation Skip to search Skip to main content

Towards a foundation model for geospatial artificial intelligence (vision paper)

  • Gengchen Mai
  • , Chris Cundy
  • , Kristy Choi
  • , Yingjie Hu
  • , Ni Lao
  • , Stefano Ermon
  • Stanford University
  • Alphabet Inc.

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

59 Scopus citations

Abstract

Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have yet to see an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges for developing multimodal foundation models for GeoAI. We first show the advantages of this idea by testing the performance of existing Large pre-trained Language Models (LLMs) (e.g. GPT-2 and GPT-3) on two geospatial semantics tasks. Results indicate that these task-agnostic LLMs can outperform task-specific fully-supervised models on both tasks with 2-9% improvement in a few-shot learning setting. However, we also show the limitations of these existing foundation models given the multimodality nature of GeoAI, especially when dealing with geometries in conjunction with other modalities. So we discuss the possibility of a multimodal foundation model which can reason over various types of geospatial data through geospatial alignments. We conclude this paper by discussing the unique risks and challenges to develop such model for GeoAI.

Original languageEnglish
Title of host publication30th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2022
EditorsMatthias Renz, Mohamed Sarwat, Mario A. Nascimento, Shashi Shekhar, Xing Xie
PublisherAssociation for Computing Machinery
ISBN (Electronic)9781450395298
DOIs
StatePublished - Nov 1 2022
Event30th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL GIS 2022 - Seattle, United States
Duration: Nov 1 2022Nov 4 2022

Publication series

NameGIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems

Conference

Conference30th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, SIGSPATIAL GIS 2022
Country/TerritoryUnited States
CitySeattle
Period11/1/2211/4/22

Keywords

  • foundation models
  • geospatial artificial intelligence
  • large language models

Fingerprint

Dive into the research topics of 'Towards a foundation model for geospatial artificial intelligence (vision paper)'. Together they form a unique fingerprint.

Cite this