Abstract
Traditional methods for assessing urban waterlogging hazards lack real-time efficiency. This study develops a rapid hazard assessment method for urban waterlogging events using social media data and large language models, with the urban waterlogging event in Guilin in 2024 as a case study. We constructed a keyword database for urban waterlogging, combined with web crawling techniques to collect related data from social media platforms. This study applied the BERT-BILSTM-CRF model, the GPT-3.5-turbo model, and the ERNIE-3.5-128K model to extract waterlogging locations, comparing them with the official waterlogging location data to validate and select the optimal model. The model was then used for urban waterlogging hazard assessment at the township scale. The results indicate that the establishment of a keyword database facilitates access to more social media data. The GPT-3.5-turbo model demonstrates superior performance in waterlogging location identification, with a coverage of 93% compared with the actual waterlogging locations, and an F1 score of 0.8066. During the Guilin 6.19 waterlogging, areas with high hazard intensities mainly concentrated in the city center; meanwhile, the hazard intensities of some townships in the suburbs intensified from 17 June to 21 June. This study validates the effectiveness and timeliness of using social media data and large language models for rapid hazard assessment of urban waterlogging events through practical cases, which can provide reliable support for future urban disaster prevention and reduction efforts.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have