Learning Expressive Human-Like Head Motion Sequences from Speech

Carlos Busso,Zhigang Deng,Ulrich Neumann,Shrikanth Narayanan

doi:10.1007/978-1-84628-907-1_6

Abstract

With the development of new trends in human-machine interfaces, animated feature films and video games, better avatars and virtual agents are required that more accurately mimic how humans communicate and interact. Gestures and speech are jointly used to express intended messages. The tone and energy of the speech, facial expression, rigid head motion and hand motion combine in a non-trivial manner as they unfold in natural human interaction. Given that the use of large motion capture datasets is expensive and can only be applied in planned scenarios, new automatic approaches are required to synthesize realistic animation that capture and resemble the complex relationship between these communicative channels. One useful and practical approach is the use of acoustic features to generate gestures, exploiting the link between gestures and speech. Since the shape of the lips is determined by the underlying articulation, acoustic features have been used to generate visual visemes that match the spoken sentences [4, 5, 12, 17]. Likewise, acoustic features have been used to synthesize facial expressions [11, 30], exploiting the fact that the same muscles used for articulation also affect the shape of the face [44, 46]. One important gesture that has received less attention than other aspects in facial animations is rigid head motion. Head motion is important not only to acknowledge active listening or replace verbal information (e.g. “nod”), but also for many aspect of human

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Expressive Human-Like Head Motion Sequences from Speech

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis
Carlos Busso ... Ulrich Neumann
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15
Carlos Busso, et. al.Carlos Busso ... Ulrich Neumann
01 Mar 2007
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15

Enriching Facial Blendshape Rigs with Physical Simulation
Yeara Kozlov ... Moritz Bächer
Computer Graphics Forum | VOL. 36
Yeara Kozlov, et. al.Yeara Kozlov ... Moritz Bächer
01 May 2017
Computer Graphics Forum | VOL. 36

Prosody off the top of the head: Prosodic contrasts can be discriminated by head motion
Erin Cvejic ... Chris Davis
Speech Communication | VOL. 52
Erin Cvejic, et. al.Erin Cvejic ... Chris Davis
13 Feb 2010
Speech Communication | VOL. 52

Simultaneous tracking of rigid head motion and non-rigid facial animation by analyzing local features statistically
Y Chen ... F Davoine
-
Y Chen, et. al.Y Chen ... F Davoine
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Expressive Human-Like Head Motion Sequences from Speech

Abstract

Talk to us

Similar Papers