Speech-Driven Animation Constrained by Appropriate Discourse Functions

Najmeh Sadoughi,Yang Liu,Carlos Busso

doi:10.1145/2663204.2663252

Abstract

Conversational agents provide powerful opportunities to interact and engage with the users. The challenge is how to create naturalistic behaviors that replicate the complex gestures observed during human interactions. Previous studies have used rule-based frameworks or data-driven models to generate appropriate gestures, which are properly synchronized with the underlying discourse functions. Among these methods, speech-driven approaches are especially appealing given the rich information conveyed on speech. It captures emotional cues and prosodic patterns that are important to synthesize behaviors (i.e., modeling the variability and complexity of the timings of the behaviors). The main limitation of these models is that they fail to capture the underlying semantic and discourse functions of the message (e.g., nodding). This study proposes a speech-driven framework that explicitly model discourse functions, bridging the gap between speech-driven and rule-based models. The approach is based on dynamic Bayesian Network (DBN), where an additional node is introduced to constrain the models by specific discourse functions. We implement the approach by synthesizing head and eyebrow motion. We conduct perceptual evaluations to compare the animations generated using the constrained and unconstrained models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech-Driven Animation Constrained by Appropriate Discourse Functions

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Expletives and Dummies
R Sornicola
Encyclopedia of Language and Linguistics, 14-Volume Set | VOL. -
R SornicolaR Sornicola
01 Jan 2006
Encyclopedia of Language and Linguistics, 14-Volume Set | VOL. -

Speech-driven animation with meaningful behaviors
Najmeh Sadoughi ... Carlos Busso
Speech Communication | VOL. 110
Najmeh Sadoughi, et. al.Najmeh Sadoughi ... Carlos Busso
05 Apr 2019
Speech Communication | VOL. 110

MSP-AVATAR corpus: Motion capture recordings to study the role of discourse functions in the design of intelligent virtual agents
Najmeh Sadoughi ... Yang Liu
-
Najmeh Sadoughi, et. al.Najmeh Sadoughi ... Yang Liu
01 May 2015
01 May 2015

Risk Prediction of Gasifier System Based on Dynamic Cloud Bayesian Network
Ming Liu ... Lin Sun
Journal of Physics: Conference Series | VOL. 2381
Ming Liu, et. al.Ming Liu ... Lin Sun
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2381

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech-Driven Animation Constrained by Appropriate Discourse Functions

Abstract

Talk to us

Similar Papers