BERT for Open-Domain Conversation Modeling

Xue Zhao,Ying Zhang,Xiaojie Yuan,Wenya Guo

doi:10.1109/iccc47050.2019.9064414

Abstract

The RNN encoder-decoder structures have critical problems in generating meaningful responses. Variational autoencoders (VAE) combined with hierarchical RNNs have emerged as a powerful framework for conversation modeling, as the latent variables can encode the high-level information (topics, tones, sentiments, etc.) in conversations. On the other hand, BERT, one of the latest deep pre-trained language representation models, has achieved the remarkable state of the art across a wide range of tasks in natural language processing. However, BERT has not yet been investigated in a conversation generation task. In this paper, we explore different BERT-empowered conversation modeling approaches by incorporating BERT, RNNs, and VAEs. Moreover, BERT can be used either with weights fixed as feature extraction module or with weights updated and optimized for a specific task. In this paper, we demonstrate that simply using fixed pre-trained BERT as part of the model without further finetuning is powerful enough for generating better responses in terms of fluency, grammar, and semantic coherency. Fine-tuning can achieve the comparable results. This paper sets new baselines for conversation generation task and we are the first to demonstrate the success of BERT in conversation modeling.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

BERT for Open-Domain Conversation Modeling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Addressing posterior collapse by splitting decoders in variational recurrent autoencoders
Jianyong Sun ... Qiaohong Li
Neurocomputing | VOL. 570
Jianyong Sun, et. al.Jianyong Sun ... Qiaohong Li
11 Dec 2023
Neurocomputing | VOL. 570

Text Generation with Syntax - Enhanced Variational Autoencoder
Weijie Yuan ... Kui Meng
-
Weijie Yuan, et. al.Weijie Yuan ... Kui Meng
18 Jul 2021
18 Jul 2021

Semi-Supervised Seq2seq Joint-Stochastic-Approximation Autoencoders With Applications to Semantic Parsing
Yunfu Song ... Zhijian Ou
IEEE Signal Processing Letters | VOL. 27
Yunfu Song, et. al.Yunfu Song ... Zhijian Ou
05 Dec 2019
IEEE Signal Processing Letters | VOL. 27

Word Embedding for Bengali Language using Domain-related Corpus
Ashutosh Bandyopadhyay ... Jayashree Nair
-
Ashutosh Bandyopadhyay, et. al.Ashutosh Bandyopadhyay ... Jayashree Nair
26 Apr 2023
26 Apr 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BERT for Open-Domain Conversation Modeling

Abstract

Talk to us

Similar Papers