A coupled HMM approach to video-realistic speech animation

Lei Xie,Zhi-Qiang Liu

doi:10.1016/j.patcog.2006.12.001

Abstract

We propose a coupled hidden Markov model (CHMM) approach to video-realistic speech animation, which realizes realistic facial animations driven by speaker independent continuous speech. Different from hidden Markov model (HMM)-based animation approaches that use a single-state chain, we use CHMMs to explicitly model the subtle characteristics of audio–visual speech, e.g., the asynchrony, temporal dependency (synchrony), and different speech classes between the two modalities. We derive an expectation maximization (EM)-based A/V conversion algorithm for the CHMMs, which converts acoustic speech into decent facial animation parameters. We also present a video-realistic speech animation system. The system transforms the facial animation parameters to a mouth animation sequence, refines the animation with a performance refinement process, and finally stitches the animated mouth with a background facial sequence seamlessly. We have compared the animation performance of the CHMM with the HMMs, the multi-stream HMMs and the factorial HMMs both objectively and subjectively. Results show that the CHMMs achieve superior animation performance. The ph- vi-CHMM system, which adopts different state variables (phoneme states and viseme states) in the audio and visual modalities, performs the best. The proposed approach indicates that explicitly modelling audio–visual speech is promising for speech animation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A coupled HMM approach to video-realistic speech animation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Journal: Pattern Recognition	Publication Date: Jan 18, 2007
Citations: 109

Similar Papers

Improving acoustic event detection using generalizable visual features and multi-modality modeling
Po-Sen Huang ... Mark Hasegawa-Johnson
-
Po-Sen Huang, et. al.Po-Sen Huang ... Mark Hasegawa-Johnson
01 May 2011
01 May 2011

Speech Animation Using Coupled Hidden Markov Models
Lei Xie ... Zhi-Qiang Liu
-
Lei Xie, et. al. Lei Xie ... Zhi-Qiang Liu
01 Jan 2006
01 Jan 2006

The Learning Algorithms of Coupled Discrete Hidden Markov Models
Shi Ping Du ... Jian Wang
Applied Mechanics and Materials | VOL. 411-414
Shi Ping Du, et. al.Shi Ping Du ... Jian Wang
01 Sep 2013
Applied Mechanics and Materials | VOL. 411-414

Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance
P.S Aleksic ... K Katsaggelos
-
P.S Aleksic, et. al.P.S Aleksic ... K Katsaggelos
01 Jan 2004
01 Jan 2004

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A coupled HMM approach to video-realistic speech animation

Abstract

Talk to us

Similar Papers

More From: Pattern Recognition