Gesture Video Research Articles

Implementation of automatic sign language translation software in the process of social inclusion of people with hearing impairment is an important task. Social inclusion for people with hearing disabilities is an acute problem that must be solved in the context of the development of IT technologies and legislative initiatives that ensure the rights of people with disabilities and their equal opportunities. This substantiates the relevance of the research of assistive technologies, in the context of software tools, such as the process of social inclusion of people with severe hearing impairment in society. The subject of research is methods of automated sign language translation using intelligent technologies. The purpose of the work is the development and research of sign language automation methods to improve the quality of life of people with hearing impairments in accordance with the "Goals of Sustainable Development of Ukraine" (in the "Reduction of Inequality" part). The main tasks of the research are the development and testing of methods of converting sign language into text, converting text into sign language, as well as automating translation from one sign language to another sign language using modern intelligent technologies. Neural network modeling and 3D animation methods were used to solve these problems. The following results were obtained in the work: the main problems and tasks of social inclusion for people with hearing impairments were identified; a comparative analysis of modern methods and software platforms of automatic sign language translation was carried out; a system combining the SL-to-Text method is proposed and investigated; the Text-to-SL method using 3D animation to generate sign language concepts; the method of generating a 3D-animated gesture from video recordings; method of implementing the Sign Language1 to Sign Language2 technology. For gesture recognition, a convolutional neural network model is used, which is trained using imported and system-generated datasets of video gestures. The trained model has a high recognition accuracy (98.52%). The creation of a 3D model for displaying the gesture on the screen and its processing took place in the Unity 3D environment. The structure of the project, executive and auxiliary files used to build 3D animation for the generation of sign language concepts includes: event handler files; display results according to which they carry information about the position of the tracked points of the body; files that store the characteristics of materials that have been added to certain body mapping points. Conclusions: the proposed methods of automated translation have practical significance, which is confirmed by the demo versions of the software applications "Sign Language to Text" and "Text to Sign Language". A promising direction for continuing research on the topic of the work is the improvement of SL1-to-SL2 methods, the creation of open datasets of video gestures, the joining of scientists and developers to fill dictionaries with concepts of various sign languages.

Read full abstract

Sign language is a visual way of communicating used by people who are deaf or hard of hearing. It involves handshapes, facial expressions, and body movements to convey meaning. Sign language helps the deaf community interact with each other and the hearing world, allowing them to participate fully in society. According to the WHO (World Health Organization) over 5 % of the world’s population – or 430 million people — experience problems with hearing. More than 44,000 people with hearing impairments are registered with the Ukrainian Society of the Deaf, an all-Ukrainian public organization for the disabled. Therefore, it is extremely important to develop new software, available to the public, that would allow quickly and effectively learn and understand sign language. This work aims to review gesture recognition techniques and develop a system for detecting and classifying gestures of the Ukrainian dactylic alphabet. Two main approaches to gesture recognition, glove-based and computer vision-based (CV), are explained, with the latter being preferred due to its flexibility and widespread usage. The text elaborates on deep learning-based approaches, particularly LSTM networks, and the advantages they offer in automatically learning features from raw image data. The process of creating a dataset for training the gesture classification model is described, which involves recording videos of hand gestures and extracting keypoints using Google MediaPipe. The model training phase is detailed, covering the architecture of the LSTM-based classifier, optimization algorithms, and loss functions. The resulting model achieves an accuracy of 98.4% on the test dataset. A program for real-time gesture recognition is developed using Python and relevant libraries. The program utilizes a webcam feed to detect and classify hand gestures, displaying the top three predicted letters of the Ukrainian dactylic alphabet. The scientific novelty of the obtained results: the paper presents a method that utilizes hand keypoints for recognizing hand gestures of the Ukrainian dactyl alphabet. Also, as part of the development of the gesture recognition system, a data set was collected, where each gesture corresponds to 50 videos of 65 frames. The practical significance of the results obtained: the model obtained as a result of the study can be used to interpret the gestures of the Ukrainian dactylic alphabet. The dataset collected for training this model can be used in other works to train or validate similar models. The paper might be of use to the ones who are interested in developing similar systems for gesture recognition.

Read full abstract

Gesture Video Research Articles

Related Topics

Articles published on Gesture Video

An Embedded Neural Network Approach for Reinforcing Deep Learning: Advancing Hand Gesture Recognition

Gloss-driven Conditional Diffusion Models for Sign Language Production

The Communicative Assistance System for Deaf and Dum

Body Perception and Social Touch Preferences in Times of Grief

Deep learning models beyond temporal frame-wise features for hand gesture video recognition

Automatic sign language translation system using neural network technologies and 3D animation

Dynamic Hand Gesture Recognition for Indian Sign Language using Integrated CNN-LSTM Architecture

Interpretations of meaningful and ambiguous hand gestures in autistic and non-autistic adults: A norming study.

Gesture retrieval and its application to the study of multimodal communication

Recognizing gestures of the ukrainian dactylic alphabet

Vision-Based Hand Detection and Tracking Using Fusion of Kernelized Correlation Filter and Single-Shot Detection

An Approach for Gesture Recognition Based on a Lightweight Convolutional Neural Network

COVID-19 vaccine hesitancy and shaming on TikTok: A multimodal appraisal analysis

Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition.

Video Hand Gestures Recognition Using Depth Camera and Lightweight CNN

Automatic Quality Assessment of Speech-Driven Synthesized Gestures

A weighted sparse coding model on product Grassmann manifold for video-based human gesture recognition.

Content-Adaptive and Attention-Based Network for Hand Gesture Recognition

Development of an End-to-End Deep Learning Framework for Sign Language Recognition, Translation, and Video Generation

Autonomic system tuning during gesture observation and reproduction

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Gesture Video Research Articles

Related Topics

Articles published on Gesture Video

An Embedded Neural Network Approach for Reinforcing Deep Learning: Advancing Hand Gesture Recognition

Gloss-driven Conditional Diffusion Models for Sign Language Production

The Communicative Assistance System for Deaf and Dum

Body Perception and Social Touch Preferences in Times of Grief

Deep learning models beyond temporal frame-wise features for hand gesture video recognition

Automatic sign language translation system using neural network technologies and 3D animation

Dynamic Hand Gesture Recognition for Indian Sign Language using Integrated CNN-LSTM Architecture

Interpretations of meaningful and ambiguous hand gestures in autistic and non-autistic adults: A norming study.

Gesture retrieval and its application to the study of multimodal communication

Recognizing gestures of the ukrainian dactylic alphabet

Vision-Based Hand Detection and Tracking Using Fusion of Kernelized Correlation Filter and Single-Shot Detection

An Approach for Gesture Recognition Based on a Lightweight Convolutional Neural Network

COVID-19 vaccine hesitancy and shaming on TikTok: A multimodal appraisal analysis

Two-stream fusion model using 3D-CNN and 2D-CNN via video-frames and optical flow motion templates for hand gesture recognition.

Video Hand Gestures Recognition Using Depth Camera and Lightweight CNN

Automatic Quality Assessment of Speech-Driven Synthesized Gestures

A weighted sparse coding model on product Grassmann manifold for video-based human gesture recognition.

Content-Adaptive and Attention-Based Network for Hand Gesture Recognition

Development of an End-to-End Deep Learning Framework for Sign Language Recognition, Translation, and Video Generation

Autonomic system tuning during gesture observation and reproduction