Context‐based style transfer of tokenized gestures

Shigeru Kuriyama,Tomohiko Mukai,Takafumi Taketomi,Tomoyuki Mukasa

doi:10.1111/cgf.14645

Abstract

AbstractGestural animations in the amusement or entertainment field often require rich expressions; however, it is still challenging to synthesize characteristic gestures automatically. Although style transfer based on a neural network model is a potential solution, existing methods mainly focus on cyclic motions such as gaits and require re‐training in adding new motion styles. Moreover, their per‐pose transformation cannot consider the time‐dependent features, and therefore motion styles of different periods and timings are difficult to be transferred. This limitation is fatal for the gestural motions requiring complicated time alignment due to the variety of exaggerated or intentionally performed behaviors.This study introduces a context‐based style transfer of gestural motions with neural networks to ensure stable conversion even for exaggerated, dynamically complicated gestures. We present a model based on a vision transformer for transferring gestures' content and style features by time‐segmenting them to compose tokens in a latent space. We extend this model to yield the probability of swapping gestures' tokens for style‐transferring. A transformer model is suited to semantically consistent matching among gesture tokens, owing to the correlation with spoken words. The compact architecture of our network model requires only a small number of parameters and computational costs, which is suitable for real‐time applications with an ordinary device.We introduce loss functions provided by the restoration error of identically and cyclically transferred gesture tokens and the similarity losses of content and style evaluated by splicing features inside the transformer. This design of losses allows unsupervised and zero‐shot learning, by which the scalability for motion data is obtained.We comparatively evaluated our style transfer method, mainly focusing on expressive gestures using our dataset captured for various scenarios and styles by introducing new error metrics tailored for gestures. Our experiment showed the superiority of our method in numerical accuracy and stability of style transfer against the existing methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Context‐based style transfer of tokenized gestures

Abstract

Talk to us

Similar Papers

More From: Computer Graphics Forum

Lead the way for us

Similar Papers

Artistic Style Discovery with Independent Components
Xin Xie ... Wanwan Wang
-
Xin Xie, et. al.Xin Xie ... Wanwan Wang
01 Jun 2022
01 Jun 2022

StyleVR: Stylizing Character Animations With Normalizing Flows.
Bin Ji ... Yichao Yan
IEEE transactions on visualization and computer graphics | VOL. 30
Bin Ji, et. al.Bin Ji ... Yichao Yan
01 Jul 2024
IEEE transactions on visualization and computer graphics | VOL. 30

Optimal Transport-Based Patch Matching for Image Style Transfer
Jie Li ... Shaowen Yao
IEEE Transactions on Multimedia | VOL. 25
Jie Li, et. al.Jie Li ... Shaowen Yao
01 Jan 2023
IEEE Transactions on Multimedia | VOL. 25

Stylized image denoising via noise style transfer and Quasi Siamese network
Jikang Cheng ... Zhongyuan Wang
Signal Processing: Image Communication | VOL. 115
Jikang Cheng, et. al.Jikang Cheng ... Zhongyuan Wang
21 Mar 2023
Signal Processing: Image Communication | VOL. 115

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Context‐based style transfer of tokenized gestures

Abstract

Talk to us

Similar Papers

More From: Computer Graphics Forum