Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems

Jian Wang,Junhao Liu,Ruifeng Xu,Wei Bi,Min Yang,Xiaojiang Liu,Kejing He

doi:10.18653/v1/2020.coling-main.362

Abstract

Existing end-to-end task-oriented dialog systems struggle to dynamically model long dialog context for interactions and effectively incorporate knowledge base (KB) information into dialog generation. To conquer these limitations, we propose a Dual Dynamic Memory Network (DDMN) for multi-turn dialog generation, which maintains two core components: dialog memory manager and KB memory manager. The dialog memory manager dynamically expands the dialog memory turn by turn and keeps track of dialog history with an updating mechanism, which encourages the model to filter irrelevant dialog history and memorize important newly coming information. The KB memory manager shares the structural KB triples throughout the whole conversation, and dynamically extracts KB information with a memory pointer at each turn. Experimental results on three benchmark datasets demonstrate that DDMN significantly outperforms the strong baselines in terms of both automatic evaluation and human evaluation. Our code is available at https://github.com/siat-nlp/DDMN.

Highlights

Task-oriented dialog systems are designed to help users achieve specific goals with natural language, such as weather inquiry or restaurant reservation
By analyzing the responses generated by BossNet, we reveal that BossNet tends to copy necessary entity words from the knowledge base (KB) but many entity words are out of order compared with the gold response
MLM achieves a much higher BLEU score than previous models, which is due to its separate memories for modeling dialog context and KB results

Summary

Introduction

Task-oriented dialog systems are designed to help users achieve specific goals with natural language, such as weather inquiry or restaurant reservation. Despite the remarkable progress of previous studies, current memory based models for multi-turn taskoriented dialog systems still suffer from the following limitations. Existing methods concatenate dialog utterances of current turn and previous turns as a whole, which ignore previous reasoning process performed by the model and are incapable of dynamically tracking long-term dialog states. These methods introduce much noise since previous utterances as the context is lengthy and redundant (Zhang et al, 2018). Previous studies tend to confound dialog history with KB knowledge, and store them into a flat memory

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2020
Citations: 18	License type: cc-by

Similar Papers

Multi-goal multi-agent learning for task-oriented dialogue with bidirectional teacher–student learning
Wanwei He ... Ruifeng Xu
Knowledge-Based Systems | VOL. 213
Wanwei He, et. al.Wanwei He ... Ruifeng Xu
09 Dec 2020
Knowledge-Based Systems | VOL. 213

GraphMemDialog: Optimizing End-to-End Task-Oriented Dialog Systems Using Graph Memory Networks
Jie Wu ... Hongzhi Zhao
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Jie Wu, et. al.Jie Wu ... Hongzhi Zhao
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

UniMF: A Unified Framework to Incorporate Multimodal Knowledge Bases intoEnd-to-End Task-Oriented Dialogue Systems
Shiquan Yang ... Sarah M Erfani
-
Shiquan Yang, et. al.Shiquan Yang ... Sarah M Erfani
01 Aug 2021
01 Aug 2021

Contextualize Knowledge Bases with Transformer for End-to-end Task-Oriented Dialogue Systems
...
-
, et. al. ...
21 Oct 2021
21 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dual Dynamic Memory Network for End-to-End Multi-turn Task-oriented Dialog Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers