Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags

Abdulkadir Memduhoğlu,Nir Fulman,Alexander Zipf

doi:10.1007/s12145-024-01463-8

Abstract

Automated methods for building function classification are essential due to restricted access to official building use data. Existing approaches utilize traditional Natural Language Processing (NLP) techniques to analyze textual data representing human activities, but they struggle with the ambiguity of semantic contexts. In contrast, Large Language Models (LLMs) excel at capturing the broader context of language. This study presents a method that uses LLMs to interpret OpenStreetMap (OSM) tags, combining them with physical and spatial metrics to classify urban building functions. We employed an XGBoost model trained on 32 features from six city datasets to classify urban building functions, demonstrating varying F1 scores from 67.80% in Madrid to 91.59% in Liberec. Integrating LLM embeddings enhanced the model's performance by an average of 12.5% across all cities compared to models using only physical and spatial metrics. Moreover, integrating LLM embeddings improved the model's performance by 6.2% over models that incorporate OSM tags as one-hot encodings, and when predicting based solely on OSM tags, the LLM approach outperforms traditional NLP methods in 5 out of 6 cities. These results suggest that deep contextual understanding, as captured by LLM embeddings more effectively than traditional NLP approaches, is beneficial for classification. Finally, a Pearson correlation coefficient of approximately -0.858 between population density and F1-scores suggests that denser areas present greater classification challenges. Moving forward, we recommend investigation into discrepancies in model performance across and within cities, aiming to identify generalized models.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags

Abstract

Published Version

Talk to us

Similar Papers

More From: Earth Science Informatics

Lead the way for us

Journal: Earth Science Informatics	Publication Date: Aug 27, 2024
License type: CC BY 4.0

Similar Papers

#2924 Comparison of large language models and traditional natural language processing techniques in predicting arteriovenous fistula failure
Suman Lama ... Luca Neri
Nephrology Dialysis Transplantation | VOL. 39
Suman Lama, et. al.Suman Lama ... Luca Neri
23 May 2024
Nephrology Dialysis Transplantation | VOL. 39

Harnessing LLMs for Financial Forecasting: A Systematic Review of Advances in Stock Market Prediction and Portfolio Optimization
Prof Maheshwari Divate ... Krushna Darak
International Journal for Research in Applied Science and Engineering Technology | VOL. 12
Prof Maheshwari Divate, et. al.Prof Maheshwari Divate ... Krushna Darak
30 Nov 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. 12

Entity Extraction of Key Elements in 110 Police Reports Based on Large Language Models
Xintao Xing ... Peng Chen
Applied Sciences | VOL. 14
Xintao Xing, et. al.Xintao Xing ... Peng Chen
03 Sep 2024
Applied Sciences | VOL. 14

Performance of Large Language Models on a Neurology Board–Style Examination
Marc Cicero Schubert ... Varun Venkataramani
JAMA network open | VOL. 6
Marc Cicero Schubert, et. al.Marc Cicero Schubert ... Varun Venkataramani
07 Dec 2023
JAMA network open | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Enriching building function classification using Large Language Model embeddings of OpenStreetMap Tags

Abstract

Published Version

Talk to us

Similar Papers

More From: Earth Science Informatics