Multi-Modal Deep Hand Sign Language Recognition in Still Images Using Restricted Boltzmann Machine.

Razieh Rastgoo,Sergio Escalera,Kourosh Kiani

doi:10.3390/e20110809

Razieh Rastgoo, Sergio Escalera + Show 1 more

Open Access

https://doi.org/10.3390/e20110809

Copy DOI

Journal: Entropy	Publication Date: Oct 23, 2018
Citations: 77	License type: CC BY 4.0

Affiliation: Semnan University, Barcelona Supercomputing Center

Abstract

In this paper, a deep learning approach, Restricted Boltzmann Machine (RBM), is used to perform automatic hand sign language recognition from visual data. We evaluate how RBM, as a deep generative model, is capable of generating the distribution of the input data for an enhanced recognition of unseen data. Two modalities, RGB and Depth, are considered in the model input in three forms: original image, cropped image, and noisy cropped image. Five crops of the input image are used and the hand of these cropped images are detected using Convolutional Neural Network (CNN). After that, three types of the detected hand images are generated for each modality and input to RBMs. The outputs of the RBMs for two modalities are fused in another RBM in order to recognize the output sign label of the input image. The proposed multi-modal model is trained on all and part of the American alphabet and digits of four publicly available datasets. We also evaluate the robustness of the proposal against noise. Experimental results show that the proposed multi-modal model, using crops and the RBM fusing methodology, achieves state-of-the-art results on Massey University Gesture Dataset 2012, American Sign Language (ASL). and Fingerspelling Dataset from the University of Surrey’s Center for Vision, Speech and Signal Processing, NYU, and ASL Fingerspelling A datasets.

Highlights

Deaf people have many problems in communicating with other people in society
We propose a deep-based model using Restricted Boltzmann Machine (RBM) to improve sign language recognition accuracy from two input modalities, RGB and Depth
Details of the achieved results of the proposed method on four public datasets are discussed

Summary

Introduction

Deaf people have many problems in communicating with other people in society. Due to impairment in hearing and speaking, profoundly deaf people cannot have normal communication with other people. A special language is fundamental in order for profoundly deaf people to be able to communicate with others [1]. Some projects and studies have been proposed to create or improve smart systems for this population to recognize and detect the sign language from hand and face gestures in visual data. While each method provides different properties, more research is required to provide a complete and accurate model for sign language recognition. Using deep learning approaches has become common for improving the recognition accuracy of sign language models in recent years. We use a generative deep model, Restricted Boltzmann

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Modal Deep Hand Sign Language Recognition in Still Images Using Restricted Boltzmann Machine.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy

Lead the way for us

Similar Papers

Multi antenna radar system for American sign language (ASL) recognition using deep learning
Gavin Maclaughlin ... Jack F Malcolm
-
Gavin Maclaughlin, et. al.Gavin Maclaughlin ... Jack F Malcolm
31 May 2022
31 May 2022

MyoSign
Qian Zhang ... Yinggang Yu
-
Qian Zhang, et. al.Qian Zhang ... Yinggang Yu
17 Mar 2019
17 Mar 2019

An overview on Restricted Boltzmann Machines
Nan Zhang ... Shifei Ding
Neurocomputing | VOL. 275
Nan Zhang, et. al.Nan Zhang ... Shifei Ding
28 Sep 2017
Neurocomputing | VOL. 275

The Shape of Air: American Sign Language as Narrative Prosthesis in 21st Century North American Media
Pamela J Kincheloe
M/C Journal | VOL. 22
Pamela J KincheloePamela J Kincheloe
09 Oct 2019
M/C Journal | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Modal Deep Hand Sign Language Recognition in Still Images Using Restricted Boltzmann Machine.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Entropy