Abstract

A deep understanding of the cause-effect relationship of bridge damages provides an opportunity to design, construct, and maintain bridge structures more effectively. The damage factors (i.e., bridge element, damage, and cause) and their complex relationships can be extracted from bridge inspection reports; however, it is not practical to manually read a considerable number of inspection documents and extract such valuable information. Although existing studies attempted to automatically analyze inspection reports, they require a large amount of human effort for data labeling and model development. To overcome the limitations, the authors propose an efficient information acquisition approach that extracts damage factors and causal relationships from bridge inspection reports. The named entity recognition (NER) model was developed based on a recurrent neural network (RNN) and was trained with the active learning method. In the experiments performed with 1,650 sentences (i.e., 1,300 for training and 350 for testing), the developed model successfully classified categories of text words (i.e., damage factors) and captured their causal relationship with 0.927 accuracy and 0.860 F1 score. Besides, the active learning method could significantly reduce the human effort required for data labeling and model development. The developed model achieved 0.778 F1 score only using 140 sentences, requiring less than an hour for manual labeling. These results meant that the model was able to successfully extract major damage factors and their cause-effect relationships from a set of text sentences with little effort. Consequently, the findings of this study can help field engineers to design, construct, and maintain bridge structures.

Highlights

  • Bridges play a key role in maintaining traffic flow and contribute to the economic development of a society; it is essential to secure bridge infrastructure safety for public safety and the national economy (Deng et al 2016; LeBeau and Wadia-Fascetti 2007)

  • The causal information would benefit bridge inspectors and managers. They can determine the order of inspection priority; which bridge has the most urgent and serious causes? It should be noted that, according to data availability, this paper focuses on bridge inspection reports written in the Korean language

  • As the reports are stored in portable document format (PDF) formats, plain text data were extracted from the PDF files using the pdftotext, which is an open-source text extraction tool (Noonburg 2017)

Read more

Summary

Introduction

Bridges play a key role in maintaining traffic flow and contribute to the economic development of a society; it is essential to secure bridge infrastructure safety for public safety and the national economy (Deng et al 2016; LeBeau and Wadia-Fascetti 2007). Many studies have paid attention to analyze the cause-effect relationship of bridge damages from the buried information in inspection reports (Jeon et al 2017; Lokuge et al 2016; Peris-Sayol et al 2017). The considerable number of bridge inspection reports makes it impractical to collect, read, and utilize such valuable information manually (Liu and El-Gohary 2017; Ryu and Shin 2014). Since the text is written in unstructured natural language, bridge engineers should manually read and understand every text to extract the historical cause-effect information from the reports. Considering that there are numerous text sentences in one inspection report, an automated information extraction method is needed to retrieve valuable information from a significant number of bridge inspection reports.

Literature Review
Experimental Setup
10 Girder
Results and Discussion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call