Multi-Modal Fusion Sign Language Recognition Based on Residual Network and Attention Mechanism

Chaoqin Chu,Yinhuan Zhang,Qinkun Xiao,Xing Liu

doi:10.1142/s0218001422500367

Abstract

Sign language recognition (SLR) is a useful tool for the deaf-mute to communicate with the outside world. Although many SLR methods have been proposed and have demonstrated good performance, continuous SLR (CSLR) is still challenging. Meanwhile, due to the heavy occlusions and closely interacting motions, there is a higher requirement for the real-time efficiency of CSLR. Therefore, the performance of CSLR needs further improvement. The highlights include: (1) to overcome these challenges, this paper proposes a novel video-based CSLR framework. This framework consists of three components: an OpenPose-based skeleton stream extraction module, a RGB stream extraction module, and a combination module of the BiLSTM network and the conditional hidden Markov model (CHMM) for CSLR. (2) A new residual network with Squeeze-and-Excitation blocks (SEResNet50) for video sequence feature extraction. (3) This paper combines the SEResNet50 module with the BiLSTM network to extract the feature information from video streams with different modalities. To evaluate the effectiveness of our proposed framework, experiments are conducted on two CSL datasets. The experimental results indicate that our method is superior to the methods in the literature.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Modal Fusion Sign Language Recognition Based on Residual Network and Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: Sep 3, 2022
Citations: 3

Similar Papers

Multi-Information Spatial–Temporal LSTM Fusion Continuous Sign Language Neural Machine Translation
Qinkun Xiao ... Xue Zhang
IEEE access : practical innovations, open solutions | VOL. 8
Qinkun Xiao, et. al.Qinkun Xiao ... Xue Zhang
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

A Deep Neural Framework for Continuous Sign Language Recognition by Iterative Training
Runpeng Cui ... Hu Liu
IEEE Transactions on Multimedia | VOL. 21
Runpeng Cui, et. al.Runpeng Cui ... Hu Liu
01 Jul 2019
IEEE Transactions on Multimedia | VOL. 21

EM-Sign: A Non-Contact Recognition Method Based on 24 GHz Doppler Radar for Continuous Signs and Dialogues
Linting Ye ... Shengchang Lan
Electronics | VOL. 9
Linting Ye, et. al.Linting Ye ... Shengchang Lan
26 Sep 2020
Electronics | VOL. 9

Fully Convolutional Networks for Continuous Sign Language Recognition
Ka Leong Cheng ... Yu-Wing Tai
-
Ka Leong Cheng, et. al.Ka Leong Cheng ... Yu-Wing Tai
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Modal Fusion Sign Language Recognition Based on Residual Network and Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence