Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation

Yucheng Suo,Xiaohan Wang,Zhedong Zheng,Bang Zhang,Yi Yang

doi:10.1145/3648368

Abstract

Sign language provides a way for differently-abled individuals to express their feelings and emotions. However, learning sign language can be challenging and time consuming. An alternative approach is to animate user photos using sign language videos of specific words, which can be achieved using existing image animation methods. However, the finger motions in the generated videos are often not ideal. To address this issue, we propose the Structure-aware Temporal Consistency Network (STCNet), which jointly optimizes the prior structure of humans with temporal consistency to produce sign language videos. We use a fine-grained skeleton detector to acquire knowledge of body structure and introduce both short- and long-term cycle loss to ensure the continuity of the generated video. The two losses and keypoint detector network are optimized in an end-to-end manner. Quantitative and qualitative evaluations on three widely used datasets, namely LSA64, Phoenix-2014T, and WLASL-2000, demonstrate the effectiveness of the proposed method. It is our hope that this work can contribute to future studies on sign language production.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications

Lead the way for us

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Mar 26, 2024
Citations: 4

Similar Papers

Sign Language Recognition and Video Generation Using Deep Learning
Meera Treesa Mathews ... Joyal Raphel
Journal of Applied Science, Engineering, Technology and Management | VOL. 1
Meera Treesa Mathews, et. al.Meera Treesa Mathews ... Joyal Raphel
02 Dec 2023
Journal of Applied Science, Engineering, Technology and Management | VOL. 1

A survey on recent advances in Sign Language Production
Razieh Rastgoo ... Mohammad Sabokrou
Expert Systems with Applications | VOL. 243
Razieh Rastgoo, et. al.Razieh Rastgoo ... Mohammad Sabokrou
09 Dec 2023
Expert Systems with Applications | VOL. 243

Identifying Sign Language Videos in Video Sharing Sites
Frank M Shipman ... Caio D D Monteiro
ACM Transactions on Accessible Computing | VOL. 5
Frank M Shipman, et. al.Frank M Shipman ... Caio D D Monteiro
01 Mar 2014
ACM Transactions on Accessible Computing | VOL. 5

Changing the Representation: Examining Language Representation for Neural Sign Language Production
...
arXiv (Cornell University) | VOL. -
, et. al. ...
16 Sep 2022
arXiv (Cornell University) | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Multimedia Computing, Communications, and Applications