Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

Yue Gu,Ivan Marsic,Shuhong Chen,Ruiyu Zhang,Jalal Abdulbaqi,Megan Cheng,Randall S Burd,Xinwei Zhao

doi:10.1109/ichi.2019.8904713

Abstract

Trauma activity recognition aims to detect, recognize, and predict the activities (or tasks) during a trauma resuscitation. Previous work has mainly focused on using various sensor data including image, RFID, and vital signals to generate the trauma event log. However, spoken language and environmental sound, which contain rich communication and contextual information necessary for trauma team cooperation, are still largely ignored. In this paper, we propose a multimodal attention network (MAN) that uses both verbal transcripts and environmental audio stream as input; the model extracts textual and acoustic features using a multi-level multi-head attention module, and forms a final shared representation for trauma activity classification. We evaluated the proposed architecture on 75 actual trauma resuscitation cases collected from a hospital. We achieved 72.4% accuracy with 0.705 F1 score, demonstrating that our proposed architecture is useful and efficient. These results also show that using spoken language and environmental audio indeed helps identify hard-to-recognize activities, compared to previous approaches. We also provide a detailed analysis of the performance and generalization of the proposed multimodal attention network.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics

Lead the way for us

Journal: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics	Publication Date: Jun 1, 2019
Citations: 8

Similar Papers

Multimodal Attention with Image Text Spatial Relationship for OCR-Based Image Captioning
Jing Wang ... Jinhui Tang
-
Jing Wang, et. al.Jing Wang ... Jinhui Tang
12 Oct 2020
12 Oct 2020

Robust footstep identification system based on acoustic local features
Fangxia Guo ... Xuan Wang
IET Biometrics | VOL. 6
Fangxia Guo, et. al.Fangxia Guo ... Xuan Wang
14 Mar 2017
IET Biometrics | VOL. 6

Semantic segmentation of remote sensing images based on dilated convolution and spatial-channel attention mechanism
Huazhong Jin ... Xueli Chang
Journal of Applied Remote Sensing | VOL. 17
Huazhong Jin, et. al.Huazhong Jin ... Xueli Chang
29 Mar 2023
Journal of Applied Remote Sensing | VOL. 17

The Golden Opportunity: Multidisciplinary Simulation Training Improves Trauma Team Efficiency
Andrea M Long ... Jeffrey E Carter
Journal of Surgical Education | VOL. 76
Andrea M Long, et. al.Andrea M Long ... Jeffrey E Carter
31 Jan 2019
Journal of Surgical Education | VOL. 76

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal Attention Network for Trauma Activity Recognition from Spoken Language and Environmental Sound.

Abstract

Talk to us

Similar Papers

More From: IEEE International Conference on Healthcare Informatics. IEEE International Conference on Healthcare Informatics