Variational Denoising Autoencoders and Least-Squares Policy Iteration for Statistical Dialogue Managers

Vassilios Diakoloukas,Margarita Kotti,Fotios Lygerakis,Michail G Lagoudakis

doi:10.1109/lsp.2020.2998361

Variational Denoising Autoencoders and Least-Squares Policy Iteration for Statistical Dialogue Managers

Vassilios Diakoloukas, Margarita Kotti + Show 2 more

Open Access

https://doi.org/10.1109/lsp.2020.2998361

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2020
Citations: 23

Affiliation: Technical University of Crete, Toshiba (United Kingdom), Deloitte (United Kingdom)

#Least-Squares Policy Iteration #Noise-robust Performance + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The use of Reinforcement Learning (RL) approaches for dialogue policy optimization has been the new trend for dialogue management systems. Several methods have been proposed, which are trained on dialogue data to provide optimal system response. However, most of these approaches exhibit performance degradation in the presence of noise, poor scalability to other domains, as well as performance instabilities. To overcome these problems, we propose a novel approach based on the incremental, sample-efficient Least-Squares Policy Iteration (LSPI) algorithm, which is trained on compact, fixed-size dialogue state encodings, obtained from deep Variational Denoising Autoencoders (VDAE). The proposed scheme exhibits stable and noise-robust performance, which significantly outperforms the current state-of-the-art, even in mismatched noise environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: IEEE Signal Processing Letters

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.