User simulations for context-sensitive speech recognition in spoken dialogue systems

Oliver Lemon,Loannis Konstas

doi:10.3115/1609067.1609123

Abstract

We use a machine learner trained on a combination of acoustic and contextual features to predict the accuracy of incoming n-best automatic speech recognition (ASR) hypotheses to a spoken dialogue system (SDS). Our novel approach is to use a simple statistical User Simulation (US) for this task, which measures the likelihood that the user would say each hypothesis in the current context. Such US models are now common in machine learning approaches to SDS, are trained on real dialogue data, and are related to theories of alignment in psycholinguistics. We use a US to predict the user's next dialogue move and thereby re-rank n-best hypotheses of a speech recognizer for a corpus of 2564 user utterances. The method achieved a significant relative reduction of Word Error Rate (WER) of 5% (this is 44% of the possible WER improvement on this data), and 62% of the possible semantic improvement (Dialogue Move Accuracy), compared to the baseline policy of selecting the topmost ASR hypothesis. The majority of the improvement is attributable to the User Simulation feature, as shown by Information Gain analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

User simulations for context-sensitive speech recognition in spoken dialogue systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Duration normalization and hypothesis combination for improved spontaneous speech recognition
Jon P Nedel ... Richard M Stern
-
Jon P Nedel, et. al.Jon P Nedel ... Richard M Stern
01 Sep 2003
01 Sep 2003

Duration normalization for improved automatic speech recognition
Jon P Nedel ... Richard M Stern
The Journal of the Acoustical Society of America | VOL. 112
Jon P Nedel, et. al.Jon P Nedel ... Richard M Stern
25 Oct 2002
The Journal of the Acoustical Society of America | VOL. 112

Improving Entity Recall in Automatic Speech Recognition with Neural Embeddings
Christopher Li ... Petar Aleksic
-
Christopher Li, et. al.Christopher Li ... Petar Aleksic
06 Jun 2021
06 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

User simulations for context-sensitive speech recognition in spoken dialogue systems

Abstract

Talk to us

Similar Papers