Automatic bridge inspection database construction through hybrid information extraction and large language models

Chenhong Zhang,Xiaoming Lei,Ye Xia,Limin Sun

doi:10.1016/j.dibe.2024.100549

Abstract

Regular bridge inspections generate extensive reports that, while critical for maintenance, often remain underutilized due to their unstructured format. Traditional information extraction methods depend on intricate labeling systems that commonly require time-consuming and labor-intensive labeling. This paper presents a novel bridge inspection database construction method leveraging LLM-assisted information extraction. First, we introduce the pseudo-labelling method using a closed-source LLM to generate high-quality data. Then we propose the hybrid extraction pipeline to extract relevant information segments and process them by a generation-based IE model, fine-tuned on pseudo-labeled data. Finally, the extracted data is used to construct the bridge inspection database. The proposed method, validated with real-world data, not only demonstrates higher extraction precision than the closed-source LLM used for pseudo-labeling but also outperforms traditional methods in both data preparation time and extraction accuracy. This approach provides a scalable solution for more proactive and data-driven bridge maintenance strategies.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic bridge inspection database construction through hybrid information extraction and large language models

Abstract

Talk to us

Similar Papers

More From: Developments in the Built Environment

Lead the way for us

Journal: Developments in the Built Environment	Publication Date: Oct 1, 2024
License type: cc-by-nc

Similar Papers

Health Care Language Models and Their Fine-Tuning for Information Extraction: Scoping Review.
Miguel Nunes ... Luis B Elvas
JMIR medical informatics | VOL. 12
Miguel Nunes, et. al.Miguel Nunes ... Luis B Elvas
21 Oct 2024
JMIR medical informatics | VOL. 12

Redefining Health Care Data Interoperability: Empirical Exploration of Large Language Models in Information Exchange.
Dukyong Yoon ... Yujin Choi
Journal of medical Internet research | VOL. 26
Dukyong Yoon, et. al.Dukyong Yoon ... Yujin Choi
22 Jan 2024
Journal of medical Internet research | VOL. 26

Quantifying the uncertainty of LLM hallucination spreading in complex adaptive social networks
Guozhi Hao ... Rosario Morello
Scientific Reports | VOL. 14
Guozhi Hao, et. al.Guozhi Hao ... Rosario Morello
16 Jul 2024
Scientific Reports | VOL. 14

Automating Information Retrieval from Biodiversity Literature Using Large Language Models: A Case Study
Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
Biodiversity Information Science and Standards | VOL. 8
Vamsi Krishna Kommineni, et. al.Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
10 Sep 2024
Biodiversity Information Science and Standards | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic bridge inspection database construction through hybrid information extraction and large language models

Abstract

Talk to us

Similar Papers

More From: Developments in the Built Environment