Impact of face swapping and data augmentation on sign language recognition

Marina Perea-Trigo,J J Vegas-Olmos,Luis M Soria-Morillo,Juan A Álvarez-García,Enrique J López-Ortiz

doi:10.1007/s10209-024-01133-y

Abstract

AbstractThis study addresses the challenge of improving communication between the deaf and hearing community by exploring different sign language recognition (SLR) techniques. Due to privacy issues and the need for validation by interpreters, creating large-scale sign language (SL) datasets can be difficult. The authors address this by presenting a new Spanish isolated sign language recognition dataset, CALSE-1000, consisting of 5000 videos representing 1000 glosses, with various signers and scenarios. The study also proposes using different computer vision techniques, such as face swapping and affine transformations, to augment the SL dataset and improve the accuracy of the model I3D trained using them. The results show that the inclusion of these augmentations during training leads to an improvement in accuracy in top-1 metrics by up to 11.7 points, top-5 by up to 8.8 points and top-10 by up to 9 points. This has great potential to improve the state of the art in other datasets and other models. Furthermore, the analysis confirms the importance of facial expressions in the model by testing with a facial omission dataset and shows how face swapping can be used to include new anonymous signers without the costly and time-consuming process of recording.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Impact of face swapping and data augmentation on sign language recognition

Abstract

Talk to us

Similar Papers

More From: Universal Access in the Information Society

Lead the way for us

Journal: Universal Access in the Information Society	Publication Date: Jul 24, 2024
License type: CC BY 4.0

Similar Papers

American Sign Language Recognition Using Leap Motion Controller with Machine Learning Approach.
Teak-Wei Chong ... Boon-Giin Lee
Sensors | VOL. 18
Teak-Wei Chong, et. al.Teak-Wei Chong ... Boon-Giin Lee
19 Oct 2018
Sensors | VOL. 18

Cross-modal knowledge distillation for continuous sign language recognition
Liqing Gao ... Wei Feng
Neural Networks | VOL. 179
Liqing Gao, et. al.Liqing Gao ... Wei Feng
30 Jul 2024
Neural Networks | VOL. 179

Sign Language Recognition: High Performance Deep Learning Approach Applyied To Multiple Sign Languages
Abdellah El Zaar ... S Bennani Dosse
E3S Web of Conferences | VOL. 351
Abdellah El Zaar, et. al.Abdellah El Zaar ... S Bennani Dosse
01 Jan 2021
E3S Web of Conferences | VOL. 351

A survey on recent advances in Sign Language Production
Razieh Rastgoo ... Mohammad Sabokrou
Expert Systems with Applications | VOL. 243
Razieh Rastgoo, et. al.Razieh Rastgoo ... Mohammad Sabokrou
09 Dec 2023
Expert Systems with Applications | VOL. 243

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of face swapping and data augmentation on sign language recognition

Abstract

Talk to us

Similar Papers

More From: Universal Access in the Information Society