Facial Animation Synthesis Research Articles

This paper presents a photo realistic facial animation synthesis approach based on an audio visual articulatory dynamic Bayesian network model (AF_AVDBN), in which the maximum asynchronies between the articulatory features, such as lips, tongue and glottis/velum, can be controlled. Perceptual Linear Prediction (PLP) features from audio speech, as well as active appearance model (AAM) features from face images of an audio visual continuous speech database, are adopted to train the AF_AVDBN model parameters. Based on the trained model, given an input audio speech, the optimal AAM visual features are estimated via a maximum likelihood estimation (MLE) criterion, which are then used to construct face images for the animation. In our experiments, facial animations are synthesized for 20 continuous audio speech sentences, using the proposed AF_AVDBN model, as well as the state-of-art methods, being the audio visual state synchronous DBN model (SS_DBN) implementing a multi-stream Hidden Markov Model, and the state asynchronous DBN model (SA_DBN). Objective evaluations on the learned AAM features show that much more accurate visual features can be learned from the AF_AVDBN model. Subjective evaluations show that the synthesized facial animations using AF_AVDBN are better than those using the state based SA_DBN and SS_DBN models, in the overall naturalness and matching accuracy of the mouth movements to the speech content.

Synthesizing expressive facial animation is a very challenging topic within the graphics community. In this paper, we present an expressive facial animation synthesis system enabled by automated learning from facial motion capture data. Accurate 3D motions of the markers on the face of a human subject are captured while he/she recites a predesigned corpus, with specific spoken and visual expressions. We present a novel motion capture mining technique that "learns" speech coarticulation models for diphones and triphones from the recorded data. A Phoneme-Independent Expression Eigenspace (PIEES) that encloses the dynamic expression signals is constructed by motion signal processing (phoneme-based time-warping and subtraction) and Principal Component Analysis (PCA) reduction. New expressive facial animations are synthesized as follows: First, the learned coarticulation models are concatenated to synthesize neutral visual speech according to novel speech input, then a texture-synthesis-based approach is used to generate a novel dynamic expression signal from the PIEES model, and finally the synthesized expression signal is blended with the synthesized neutral visual speech to create the final expressive facial animation. Our experiments demonstrate that the system can effectively synthesize realistic expressive facial animation.

Facial Animation Synthesis Research Articles

Related Topics

Articles published on Facial Animation Synthesis

Disentangling audio content and emotion with adaptive instance normalization for expressive facial animation synthesis

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

A State-of-the-Art Review on Image Synthesis With Generative Adversarial Networks

Research Paper on Expressive Facial Animation Synthesis by Learning Speech Co-articulation and Expression Spaces

Realistic emotion visualization by combining facial animation and hairstyle synthesis

Speech driven photo realistic facial animation based on an articulatory DBN model and AAM features

Position‐based facial animation synthesis

Evaluation of an image-based talking head with realistic facial expression and head motion

Parametric Facial Expression Synthesis and Animation

Expressive Speech Animation Synthesis with Phoneme‐Level Controls

Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces

Emotional face expression profiles supported by virtual human ontology

Emotional face expression profiles supported by virtual human ontology

Surface detail capturing for realistic facial animation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Facial Animation Synthesis Research Articles

Related Topics

Articles published on Facial Animation Synthesis

Disentangling audio content and emotion with adaptive instance normalization for expressive facial animation synthesis

FLNet: Landmark Driven Fetching and Learning Network for Faithful Talking Facial Animation Synthesis

A State-of-the-Art Review on Image Synthesis With Generative Adversarial Networks

Research Paper on Expressive Facial Animation Synthesis by Learning Speech Co-articulation and Expression Spaces

Realistic emotion visualization by combining facial animation and hairstyle synthesis

Speech driven photo realistic facial animation based on an articulatory DBN model and AAM features

Position‐based facial animation synthesis

Evaluation of an image-based talking head with realistic facial expression and head motion

Parametric Facial Expression Synthesis and Animation

Expressive Speech Animation Synthesis with Phoneme‐Level Controls

Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces

Emotional face expression profiles supported by virtual human ontology

Emotional face expression profiles supported by virtual human ontology

Surface detail capturing for realistic facial animation