Humanoid Audio–Visual Avatar With Emotive Text-to-Speech Synthesis

Hao Tang Hao Tang,Jilin Tu Jilin Tu,Yun Fu Yun Fu,T.S Huang,M Hasegawa-Johnson

doi:10.1109/tmm.2008.2001355

Abstract

Emotive audio-visual avatars are virtual computer agents which have the potential of improving the quality of human-machine interaction and human-human communication significantly. However, the understanding of human communication has not yet advanced to the point where it is possible to make realistic avatars that demonstrate interactions with natural-sounding emotive speech and realistic-looking emotional facial expressions. In this paper, We propose the various technical approaches of a novel multimodal framework leading to a text-driven emotive audio-visual avatar. Our primary work is focused on emotive speech synthesis, realistic emotional facial expression animation, and the co-articulation between speech gestures (i.e., lip movements) and facial expressions. A general framework of emotive text-to-speech (TTS) synthesis using a diphone synthesizer is designed and integrated into a generic 3-D avatar face model. Under the guidance of this framework, we therefore developed a realistic 3-D avatar prototype. A rule-based emotive TTS synthesis system module based on the Festival-MBROLA architecture has been designed to demonstrate the effectiveness of the framework design. Subjective listening experiments were carried out to evaluate the expressiveness of the synthetic talking avatar.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Humanoid Audio–Visual Avatar With Emotive Text-to-Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia

Lead the way for us

Journal: IEEE Transactions on Multimedia	Publication Date: Oct 1, 2008
Citations: 57

Similar Papers

Facial Expression Of Emotion
Dacher Keltner ... Paul Ekman
-
Dacher Keltner, et. al.Dacher Keltner ... Paul Ekman
19 Dec 2002
19 Dec 2002

Facial Emotion Recognition and Expression in Parkinson's Disease: An Emotional Mirror Mechanism?
Lucia Ricciardi ... James Kilner
PLOS ONE | VOL. 12
Lucia Ricciardi, et. al.Lucia Ricciardi ... James Kilner
09 Jan 2017
PLOS ONE | VOL. 12

Chapter 5 - Disorders of facial emotional expression and comprehension
Kenneth M Heilman
Handbook of Clinical Neurology | VOL. 183
Kenneth M HeilmanKenneth M Heilman
01 Jan 2020
Handbook of Clinical Neurology | VOL. 183

Audio-driven talking face generation with diverse yet realistic facial animations
Rongliang Wu ... Shijian Lu
Pattern Recognition | VOL. 144
Rongliang Wu, et. al.Rongliang Wu ... Shijian Lu
04 Aug 2023
Pattern Recognition | VOL. 144

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Humanoid Audio–Visual Avatar With Emotive Text-to-Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Multimedia