Recognizing Uncertainty in Speech

Heather Pon-Barry,Stuart M Shieber

doi:10.1155/2011/251753

Heather Pon-Barry, Stuart M Shieber

Open Access

https://doi.org/10.1155/2011/251753

Copy DOI

Abstract

We address the problem of inferring a speaker's level of certainty based on prosodic information in the speech signal, which has application in speech-based dialogue systems. We show that using phrase-level prosodic features centered around the phrases causing uncertainty, in addition to utterance-level prosodic features, improves our model's level of certainty classification. In addition, our models can be used to predict which phrase a person is uncertain about. These results rely on a novel method for eliciting utterances of varying levels of certainty that allows us to compare the utility of contextually-based feature sets. We elicit level of certainty ratings from both the speakers themselves and a panel of listeners, finding that there is often a mismatch between speakers' internal states and their perceived states, and highlighting the importance of this distinction.

Highlights

Speech-based technology has become a familiar part of our everyday lives
If we enable computers to do the same, we can improve how applications such as spoken tutorial dialogue systems [2], language learning systems [3], and voice search applications [4] interact with users
We find that our basic prosody model has lower RMS error than the nonprosodic baseline model: 0.738 compared to 1.059

Summary

Introduction

While most people can think of an instance where they have interacted with a call-center dialogue system, or command-based smartphone application, few would argue that the experience was as natural or as efficient as conversing with another human. To build computer systems that can communicate with humans using natural language, we need to know more than just the words a person is saying; we need to have an understanding of his or her internal mental state. Level of certainty is an important component of internal state. If we enable computers to do the same, we can improve how applications such as spoken tutorial dialogue systems [2], language learning systems [3], and voice search applications [4] interact with users

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: EURASIP Journal on Advances in Signal Processing	Publication Date: Dec 8, 2010
Citations: 11	License type: cc-by

R Discovery Prime

R Discovery Prime

Recognizing Uncertainty in Speech

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing

Lead the way for us

Similar Papers

The importance of sub-utterance prosody in predicting level of certainty
Heather Pon-Barry ... Stuart Shieber
-
Heather Pon-Barry, et. al.Heather Pon-Barry ... Stuart Shieber
01 Jan 2009
01 Jan 2009

American dialect identification using phonotactic and prosodic features
A Etman ... A A Louis
-
A Etman, et. al.A Etman ... A A Louis
01 Nov 2015
01 Nov 2015

Incorporating lexical and prosodic information at different levels for meeting summarization
Catherine Lai ... Steve Renals
-
Catherine Lai, et. al.Catherine Lai ... Steve Renals
14 Sep 2014
14 Sep 2014

How prosodic cues could lead to information center in speech - An alternative to ASR
Chao-Yu Su ... Chiu-Yu Tseng
-
Chao-Yu Su, et. al.Chao-Yu Su ... Chiu-Yu Tseng
01 Nov 2017
01 Nov 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognizing Uncertainty in Speech

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: EURASIP Journal on Advances in Signal Processing