Isolated Video-Based Sign Language Recognition Using a Hybrid CNN-LSTM Framework Based on Attention Mechanism

Diksha Kumari,Radhey Shyam Anand

doi:10.3390/electronics13071229

Abstract

Sign language is a complex language that uses hand gestures, body movements, and facial expressions and is majorly used by the deaf community. Sign language recognition (SLR) is a popular research domain as it provides an efficient and reliable solution to bridge the communication gap between people who are hard of hearing and those with good hearing. Recognizing isolated sign language words from video is a challenging research area in computer vision. This paper proposes a hybrid SLR framework that combines a convolutional neural network (CNN) and an attention-based long-short-term memory (LSTM) neural network. We used MobileNetV2 as a backbone model due to its lightweight structure, which reduces the complexity of the model architecture for deriving meaningful features from the video frame sequence. The spatial features are fed to LSTM optimized with an attention mechanism to select the significant gesture cues from the video frames and focus on salient features from the sequential data. The proposed method is evaluated on a benchmark WLASL dataset with 100 classes based on precision, recall, F1-score, and 5-fold cross-validation metrics. Our methodology acquired an average accuracy of 84.65%. The experiment results illustrate that our model performed effectively and computationally efficiently compared to other state-of-the-art methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Mar 26, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Isolated Video-Based Sign Language Recognition Using a Hybrid CNN-LSTM Framework Based on Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Emergency sign language recognition from variant of convolutional neural network (CNN) and long short term memory (LSTM) models
Muhammad Amir As'Ari ... Guat Si Qi
International Journal of Advances in Intelligent Informatics | VOL. 10
Muhammad Amir As'Ari, et. al.Muhammad Amir As'Ari ... Guat Si Qi
29 Feb 2024
International Journal of Advances in Intelligent Informatics | VOL. 10

Utilizing motion and spatial features for sign language gesture recognition using cascaded CNN and LSTM models
Hamzah Luqman ... Elsayed Elalfy
Turkish Journal of Electrical Engineering and Computer Sciences | VOL. 30
Hamzah Luqman, et. al.Hamzah Luqman ... Elsayed Elalfy
01 Nov 2022
Turkish Journal of Electrical Engineering and Computer Sciences | VOL. 30

Conversion of Sign Language Video to Text and Speech
Mr G Sekhar Reddy ... P Harsha Vardhan
International Journal for Research in Applied Science and Engineering Technology | VOL. 10
Mr G Sekhar Reddy, et. al.Mr G Sekhar Reddy ... P Harsha Vardhan
31 May 2022
International Journal for Research in Applied Science and Engineering Technology | VOL. 10

MyoSign
Qian Zhang ... Yinggang Yu
-
Qian Zhang, et. al.Qian Zhang ... Yinggang Yu
17 Mar 2019
17 Mar 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Isolated Video-Based Sign Language Recognition Using a Hybrid CNN-LSTM Framework Based on Attention Mechanism

Abstract

Talk to us

Similar Papers

More From: Electronics