Audio-driven talking face generation with diverse yet realistic facial animations

Rongliang Wu,Yingchen Yu,Fangneng Zhan,Jiahui Zhang,Xiaoqin Zhang,Shijian Lu

doi:10.1016/j.patcog.2023.109865

Abstract

Audio-driven talking face generation, which aims to synthesize talking faces with realistic facial animations (including accurate lip movements, vivid facial expression details and natural head poses) corresponding to the audio, has achieved rapid progress in recent years. However, most existing work focuses on generating lip movements only without handling the closely correlated facial expressions, which degrades the realism of the generated faces greatly. This paper presents DIRFA, a novel method that can generate talking faces with diverse yet realistic facial animations from the same driving audio. To accommodate fair variation of plausible facial animations for the same audio, we design a transformer-based probabilistic mapping network that can model the variational facial animation distribution conditioned upon the input audio and autoregressively convert the audio signals into a facial animation sequence. In addition, we introduce a temporally-biased mask into the mapping network, which allows to model the temporal dependency of facial animations and produce temporally smooth facial animation sequence. With the generated facial animation sequence and a source image, photo-realistic talking faces can be synthesized with a generic generation network. Extensive experiments show that DIRFA can generate talking faces with realistic facial animations effectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Audio-driven talking face generation with diverse yet realistic facial animations

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Aug 4, 2023
Citations: 4

Similar Papers

Realistic 2D Facial Animation from One Image
Jaehwan Kim ... Il-Kwon Jeong
-
Jaehwan Kim, et. al.Jaehwan Kim ... Il-Kwon Jeong
01 Jan 2010
01 Jan 2010

Exploring Non‐Linear Relationship of Blendshape Facial Animation
Xuecheng Liu ... Zhaoqi Wang
Computer Graphics Forum | VOL. 30
Xuecheng Liu, et. al.Xuecheng Liu ... Zhaoqi Wang
23 Mar 2011
Computer Graphics Forum | VOL. 30

Realistic Facial Animation by Automatic Individual Head Modeling and Facial Muscle Adjustment
Akinobu Maejima ... Shigeo Morishima
-
Akinobu Maejima, et. al.Akinobu Maejima ... Shigeo Morishima
01 Jan 2010
01 Jan 2010

Realistic 3D facial animations with subtle texture changes
D.L Jiang ... W Gao
-
D.L Jiang, et. al.D.L Jiang ... W Gao
15 Dec 2003
15 Dec 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Audio-driven talking face generation with diverse yet realistic facial animations

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition