DFCNet +: Cross-modal dynamic feature contrast net for continuous sign language recognition

Yuan Feng,Nuoyi Chen,Yumeng Wu,Caoyu Jiang,Sheng Liu,Shengyong Chen

doi:10.1016/j.imavis.2024.105260

Abstract

In sign language communication, the combination of hand signs and facial expressions is used to convey messages in a fluid manner. Accurate interpretation relies heavily on understanding the context of these signs. Current methods, however, often focus on static images, missing the continuous flow and the story that unfolds through successive movements in sign language. To address this constraint, our research introduces the Dynamic Feature Contrast Net Plus (DFCNet+), a novel model that incorporates both dynamic feature extraction and cross-modal learning. The dynamic feature extraction module of DFCNet+ uses dynamic trajectory capture to monitor and record motion across frames and apply key features as an enhancement tool that highlights pixels that are critical for recognizing important sign language movements, allowing the model to follow the temporal variation of the signs. In the cross-modal learning module, we depart from the conventional approach of aligning video frames with textual descriptions. Instead, we adopt a gloss-level alignment, which provides a more detailed match between the visual signals and their corresponding text glosses, capturing the intricate relationship between what is seen and the associated text. The enhanced proficiency of DFCNet+ in discerning inter-frame details translates to heightened precision on benchmarks such as PHOENIX14, PHOENIX14-T and CSL-Daily. Such performance underscores its advantage in dynamic feature capture and inter-modal learning compared to conventional approaches to sign language interpretation. Our code is available at https://github.com/fyzjut/DFCNet_Plus.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DFCNet +: Cross-modal dynamic feature contrast net for continuous sign language recognition

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing

Lead the way for us

Similar Papers

Deep transfer learning base on sequenced edge grid image technique for sign language recognition
Supathep Satiman ... Phayung Meesad
International journal of health sciences | VOL. -
Supathep Satiman, et. al.Supathep Satiman ... Phayung Meesad
24 Aug 2022
International journal of health sciences | VOL. -

Perspectives on the Sign Language Factor in Sub-Saharan Africa: Challenges of Sustainability.
Sam Lutalo-Kiingi ... Goedele A M De Clerck
American annals of the deaf | VOL. 162
Sam Lutalo-Kiingi, et. al.Sam Lutalo-Kiingi ... Goedele A M De Clerck
01 Jan 2017
American annals of the deaf | VOL. 162

Deaf education in Croatia
Iva Rinčić ... Amir Muzur
Croatian Medical Journal | VOL. 54
Iva Rinčić, et. al.Iva Rinčić ... Amir Muzur
01 Feb 2013
Croatian Medical Journal | VOL. 54

Dynamic Facial Expression Feature Extraction and Classification Based on Candide-3 Face Model
Dong Li ... Xinzhu Wang
-
Dong Li, et. al.Dong Li ... Xinzhu Wang
01 Sep 2014
01 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DFCNet +: Cross-modal dynamic feature contrast net for continuous sign language recognition

Abstract

Talk to us

Similar Papers

More From: Image and Vision Computing