Damage‐level classification considering both correlation between image and text data and confidence of attention map

Keisuke Maeda,Naoki Ogawa,Takahiro Ogawa,Miki Haseyama

doi:10.1111/mice.13366

Abstract

AbstractIn damage‐level classification, deep learning. models are more likely to focus on regions unrelated to classification targets because of the complexities inherent in real data, such as the diversity of damages (e.g., crack, efflorescence, and corrosion). This causes performance degradation. To solve this problem, it is necessary to handle data complexity and uncertainty. This study proposes a multimodal deep learning model that can focus on damaged regions using text data related to damage in images, such as materials and components. Furthermore, by adjusting the effect of attention maps on damage‐level classification performance based on the confidence calculated when estimating these maps, the proposed method realizes an accurate damage‐level classification. Our contribution is the development of a model with an end‐to‐end multimodal attention mechanism that can simultaneously consider both text and image data and the confidence of the attention map. Finally, experiments using real images validate the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Damage‐level classification considering both correlation between image and text data and confidence of attention map

Abstract

Talk to us

Similar Papers

More From: Computer-Aided Civil and Infrastructure Engineering

Lead the way for us

Similar Papers

Multi-modal deep learning for automated assembly of periapical radiographs
L Pfänder ... F Schwendicke
Journal of Dentistry | VOL. 135
L Pfänder, et. al.L Pfänder ... F Schwendicke
21 Jun 2023
Journal of Dentistry | VOL. 135

Multimodal Deep Learning Methods on Image and Textual Data to Predict Radiotherapy Structure Names
Priyankar Bose ... Pratip Rana
BioMedInformatics | VOL. 3
Priyankar Bose, et. al.Priyankar Bose ... Pratip Rana
25 Jun 2023
BioMedInformatics | VOL. 3

Multi-Modal Deep Learning for Assessing Surgeon Technical Skill.
Kevin Kasa ... Cari Whyne
Sensors | VOL. 22
Kevin Kasa, et. al.Kevin Kasa ... Cari Whyne
27 Sep 2022
Sensors | VOL. 22

Application of multimodal deep learning using radar and water level data for water level prediction
Seongsim Yoon ... Sangmin Bae
-
Seongsim Yoon, et. al.Seongsim Yoon ... Sangmin Bae
15 May 2023
15 May 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Damage‐level classification considering both correlation between image and text data and confidence of attention map

Abstract

Talk to us

Similar Papers

More From: Computer-Aided Civil and Infrastructure Engineering