Visual–auditory learning network for construction equipment action detection

Seunghoon Jung,Dong‐Eun Lee,Hyounseung Jang,Jaewon Jeoung,Taehoon Hong

doi:10.1111/mice.12983

Abstract

AbstractAction detection of construction equipment is critical for tracking project performance, facilitating construction automation, and fostering construction efficiency in terms of construction site monitoring. Particularly, the auditory signal can provide additional information on computer vision‐based action detection of various types of construction equipment. Therefore, this study aims to develop a visual–auditory learning network model for the action detection of construction equipment based on two modalities (i.e., vision and audition). To this end, both visual and auditory features are extracted from the multi‐modal feature extractor. In addition, the multi‐head attention and detection module is designed to conduct the localization and classification tasks in separate heads in which different attention mechanisms for each task are applied. Particularly, the content‐based attention mechanism and the dot‐product attention mechanism are, respectively, adopted for spatial attention in the localization head and channel attention in the classification head. The evaluation results show that the precision and recall of the proposed model can reach 86.92% and 84.00% with the adoption of the multi‐head attention and detection module, which has proven to improve overall detection performance by utilizing different correlations of visual and auditory features for localization and classification, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Visual–auditory learning network for construction equipment action detection

Abstract

Talk to us

Similar Papers

More From: Computer-Aided Civil and Infrastructure Engineering

Lead the way for us

Journal: Computer-Aided Civil and Infrastructure Engineering	Publication Date: Feb 23, 2023
Citations: 12

Similar Papers

One Spatio-Temporal Sharpening Attention Mechanism for Light-Weight YOLO Models Based on Sharpening Spatial Attention.
Mengfan Xue ... Yunfei Guo
Sensors | VOL. 21
Mengfan Xue, et. al.Mengfan Xue ... Yunfei Guo
28 Nov 2021
Sensors | VOL. 21

AGCA: An Adaptive Graph Channel Attention Module for Steel Surface Defect Detection
Xin Xiang ... Zenghui Wang
IEEE Transactions on Instrumentation and Measurement | VOL. 72
Xin Xiang, et. al.Xin Xiang ... Zenghui Wang
01 Jan 2023
IEEE Transactions on Instrumentation and Measurement | VOL. 72

An Efficient Dual-Pooling Channel Attention Module for Weakly-Supervised Liver Lesion Classification on CT Images
Duong Linh Phung ... Rahul Kumar Jain
-
Duong Linh Phung, et. al.Duong Linh Phung ... Rahul Kumar Jain
01 Jan 2023
01 Jan 2023

Mixed local channel attention for object detection
Dahang Wan ... Zhijie Ren
Engineering Applications of Artificial Intelligence | VOL. 123
Dahang Wan, et. al.Dahang Wan ... Zhijie Ren
24 May 2023
Engineering Applications of Artificial Intelligence | VOL. 123

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Visual–auditory learning network for construction equipment action detection

Abstract

Talk to us

Similar Papers

More From: Computer-Aided Civil and Infrastructure Engineering