Dynamic Hand Gesture Recognition Using 3D-CNN and LSTM Networks

Muneeb Ur Rehman,Usman Tariq,Muhammad Attique Khan,Fawad Ahmed,Nouf M Alzahrani,Jawad Ahmad,Faisal Abdulaziz Alfouzan

doi:10.32604/cmc.2022.019586

Abstract

Recognition of dynamic hand gestures in real-time is a difficult task because the system can never know when or from where the gesture starts and ends in a video stream. Many researchers have been working on vision-based gesture recognition due to its various applications. This paper proposes a deep learning architecture based on the combination of a 3D Convolutional Neural Network (3D-CNN) and a Long Short-Term Memory (LSTM) network. The proposed architecture extracts spatial-temporal information from video sequences input while avoiding extensive computation. The 3D-CNN is used for the extraction of spectral and spatial features which are then given to the LSTM network through which classification is carried out. The proposed model is a light-weight architecture with only 3.7 million training parameters. The model has been evaluated on 15 classes from the 20BN-jester dataset available publicly. The model was trained on 2000 video-clips per class which were separated into 80% training and 20% validation sets. An accuracy of 99% and 97% was achieved on training and testing data, respectively. We further show that the combination of 3D-CNN with LSTM gives superior results as compared to MobileNetv2 + LSTM.

Highlights

Gestures are primary tool of symbolic communication and natural form in which humans express themselves more effectively
Tab. 2 shows a comparison of the proposed 3D Convolutional Neural Network (3D-CNN) + Long Short-Term Memory (LSTM) model with other models in terms of accuracy, precision and recall using the 20BN-jester dataset for 15 classes
L2 batch normalization was introduced to MobilNetV2+LSTM model and the accuracy improved to 87%, which was better but not acceptable as compared to other techniques proposed in the literature

Summary

Introduction

Gestures are primary tool of symbolic communication and natural form in which humans express themselves more effectively. They vary from simple to more complex actions which allow us to communicate with others. CMC, 2022, vol., no.3 most flexible body part of a human body is hand, hand gestures can express rich and various form of communication between humans and machines. They are widely used for communication between humans and computers or other electronic devices such as smart phones, robotics, auto-mobile infotainment system, etc. Gesture recognition can replace human-computer interaction from touch or wired-controlled input devices [1]

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2022
Citations: 25	License type: cc-by

R Discovery Prime

R Discovery Prime

Dynamic Hand Gesture Recognition Using 3D-CNN and LSTM Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

Fundamentals of Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) network
Alex Sherstinsky
Physica D: Nonlinear Phenomena | VOL. 404
Alex SherstinskyAlex Sherstinsky
21 Jan 2020
Physica D: Nonlinear Phenomena | VOL. 404

Research and Application of Deformation Prediction Model for Deep Foundation Pit Based on LSTM
Hailin Li ... Xue Du
Wireless Communications and Mobile Computing | VOL. 2022
Hailin Li, et. al.Hailin Li ... Xue Du
06 Jul 2022
Wireless Communications and Mobile Computing | VOL. 2022

LSTM Neural Network based Tensile Stress Prediction of Rubber Streching
Dazi Li ... Yue Fang
-
Dazi Li, et. al.Dazi Li ... Yue Fang
30 Oct 2020
30 Oct 2020

Estimation of SoH and internal resistances of Lithium ion battery based on LSTM network
Chi Nguyen Van ... Duy Ta Quang
International Journal of Electrochemical Science | VOL. 18
Chi Nguyen Van, et. al.Chi Nguyen Van ... Duy Ta Quang
20 Apr 2023
International Journal of Electrochemical Science | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dynamic Hand Gesture Recognition Using 3D-CNN and LSTM Networks

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua