Multimodal fall detection for solitary individuals based on audio-video decision fusion processing

Shiqin Jiao,Guoqi Li,Guiyang Zhang,Jiahao Zhou,Jihong Li

doi:10.1016/j.heliyon.2024.e29596

Abstract

Falls often pose significant safety risks to solitary individuals, especially the elderly. Implementing a fast and efficient fall detection system is an effective strategy to address this hidden danger. We propose a multimodal method based on audio and video. On the basis of using non-intrusive equipment, it reduces to a certain extent the false negative situation that the most commonly used video-based methods may face due to insufficient lighting conditions, exceeding the monitoring range, etc. Therefore, in the foreseeable future, methods based on audio and video fusion are expected to become the best solution for fall detection. Specifically, this article outlines the following methodology: the video-based model utilizes YOLOv7-Pose to extract key skeleton joints, which are then fed into a two stream Spatial Temporal Graph Convolutional Network (ST-GCN) for classification. Meanwhile, the audio-based model employs log-scaled mel spectrograms to capture different features, which are processed through the MobileNetV2 architecture for detection. The final decision fusion of the two results is achieved through linear weighting and Dempster-Shafer (D-S) theory. After evaluation, our multimodal fall detection method significantly outperforms the single modality method, especially the evaluation metric sensitivity increased from 81.67% in single video modality to 96.67% (linear weighting) and 97.50% (D-S theory), which emphasizing the effectiveness of integrating video and audio data to achieve more powerful and reliable fall detection in complex and diverse daily life environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multimodal fall detection for solitary individuals based on audio-video decision fusion processing

Abstract

Talk to us

Similar Papers

More From: Heliyon

Lead the way for us

Journal: Heliyon	Publication Date: Apr 1, 2024
License type: cc-by

Similar Papers

Spatial Temporal Variation Graph Convolutional Networks (STV-GCN) for Skeleton-Based Emotional Action Recognition
Ming-Fong Tsai ... Chiung-Hung Chen
IEEE Access | VOL. 9
Ming-Fong Tsai, et. al.Ming-Fong Tsai ... Chiung-Hung Chen
01 Jan 2020
IEEE Access | VOL. 9

Skeleton-Based Fall Detection with Multiple Inertial Sensors Using Spatial-Temporal Graph Convolutional Networks.
Jianjun Yan ... Shuai Hu
Sensors | VOL. 23
Jianjun Yan, et. al.Jianjun Yan ... Shuai Hu
14 Feb 2023
Sensors | VOL. 23

Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.
Fanjia Li ... Juanjuan Li
Sensors | VOL. 20
Fanjia Li, et. al.Fanjia Li ... Juanjuan Li
15 Sep 2020
Sensors | VOL. 20

A Self-Attention Augmented Graph Convolutional Clustering Networks for Skeleton-Based Video Anomaly Behavior Detection
Chengming Liu ... Weiwei Li
Applied Sciences | VOL. 12
Chengming Liu, et. al.Chengming Liu ... Weiwei Li
21 Dec 2021
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multimodal fall detection for solitary individuals based on audio-video decision fusion processing

Abstract

Talk to us

Similar Papers

More From: Heliyon