GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection

Wanwei He,Jian Sun,Dermot Liu,Zheng Cao,Peng Jiang,Fei Huang,Luo Si,Yongbin Li,Yinpei Dai,Yinhe Zheng,Min Yang,Yuchuan Wu

doi:10.1609/aaai.v36i10.21320

Abstract

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. Empirical results show that GALAXY substantially improves the performance of task-oriented dialog systems, and achieves new state-of-the-art results on benchmark datasets: In-Car, MultiWOZ2.0 and MultiWOZ2.1, improving their end-to-end combined scores by 2.5, 5.3 and 5.5 points, respectively. We also show that GALAXY has a stronger few-shot ability than existing models under various low-resource settings. For reproducibility, we release the code and data at https://github.com/siat-nlp/GALAXY.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 28, 2022
Citations: 30

Similar Papers

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems
Fei Mi ... Lingjing Kong
-
Fei Mi, et. al.Fei Mi ... Lingjing Kong
01 Jan 2020
01 Jan 2020

Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems

-

21 Oct 2021
21 Oct 2021

Winnie: Task-Oriented Dialog System with Structure-Aware Contrastive Learning and Enhanced Policy Planning
Kaizhi Gao ... Suli Zou
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Kaizhi Gao, et. al.Kaizhi Gao ... Suli Zou
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Semi-supervised transfer learning with hierarchical self-regularization
Xingjian Li ... Chengzhong Xu
Pattern Recognition | VOL. 144
Xingjian Li, et. al.Xingjian Li ... Chengzhong Xu
26 Jul 2023
Pattern Recognition | VOL. 144

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-supervised Learning and Explicit Policy Injection

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence