Efficient model learning for dialog management

Finale Doshi,Nicholas Roy

doi:10.1145/1228716.1228726

Abstract

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because they are robust to the inherent uncertainty of human interaction. Like all dialog planning systems, however, POMDPs require an accurate model of the user (e.g., what the user might say or want). POMDPs are generally specified using a large probabilistic model with many parameters. These parameters are difficult to specify from domain knowledge, and gathering enough data to estimate the parameters accurately a priori is expensive. In this paper, we take a Bayesian approach to learning the user model simultaneously with dialog manager policy. At the heart of our approach is an efficient incremental update algorithm that allows the dialog manager to replan just long enough to improve the current dialog policy given data from recent interactions. The update process has a relatively small computational cost, preventing long delays in the interaction. We are able to demonstrate a robust dialog manager that learns from interaction data, out-performing a hand-coded model in simulation and in a robotic wheelchair application.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Efficient model learning for dialog management

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Mar 10, 2007
Citations: 92	License type: cc-by-nc

Similar Papers

Toward affective dialogue management using partially observable Markov decision processes
Trung Huu Bui
-
Trung Huu BuiTrung Huu Bui
12 May 2017
12 May 2017

Online Partial Conditional Plan Synthesis for POMDPs With Safe-Reachability Objectives: Methods and Experiments
Yue Wang ... Swarat Chaudhuri
IEEE Transactions on Automation Science and Engineering | VOL. 18
Yue Wang, et. al.Yue Wang ... Swarat Chaudhuri
01 Jul 2021
IEEE Transactions on Automation Science and Engineering | VOL. 18

Conversational system for information navigation based on POMDP with user focus tracking
Koichiro Yoshino ... Tatsuya Kawahara
Computer Speech & Language | VOL. 34
Koichiro Yoshino, et. al.Koichiro Yoshino ... Tatsuya Kawahara
20 Jan 2015
Computer Speech & Language | VOL. 34

A tractable hybrid DDN–POMDP approach to affective dialogue modeling for probabilistic frame-based dialogue systems
Trung H Bui ... Mannes Poel
Natural Language Engineering | VOL. 15
Trung H Bui, et. al.Trung H Bui ... Mannes Poel
01 Apr 2009
Natural Language Engineering | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient model learning for dialog management

Abstract

Talk to us

Similar Papers