Using Context Information for Dialog Act Classification in DNN Framework

Yang Liu,Zhao Tan,Kun Han,Yun Lei

doi:10.18653/v1/d17-1231

Abstract

Previous work on dialog act (DA) classification has investigated different methods, such as hidden Markov models, maximum entropy, conditional random fields, graphical models, and support vector machines. A few recent studies explored using deep learning neural networks for DA classification, however, it is not clear yet what is the best method for using dialog context or DA sequential information, and how much gain it brings. This paper proposes several ways of using context information for DA classification, all in the deep learning framework. The baseline system classifies each utterance using the convolutional neural networks (CNN). Our proposed methods include using hierarchical models (recurrent neural networks (RNN) or CNN) for DA sequence tagging where the bottom layer takes the sentence CNN representation as input, concatenating predictions from the previous utterances with the CNN vector for classification, and performing sequence decoding based on the predictions from the sentence CNN model. We conduct thorough experiments and comparisons on the Switchboard corpus, demonstrate that incorporating context information significantly improves DA classification, and show that we achieve new state-of-the-art performance for this task.

Highlights

Dialog act (DA) represents a function of a speaker’s utterance in either human-to-human or human-to-computer conversations
If the previous sentence is a question, there is a high probability that the current sentence is a response to that question. Such context information has been explored in some previous methods, for example, hidden Markov models (HMM), conditional random fields (CRF), dynamic Bayesian networks (DBN)
Given the recent success of the deep learning framework in various language processing tasks, in this work we employ neural networks for DA classification. Such models have been used in some recent studies for DA classification, e.g., (RojasBarahona et al, 2016; Kalchbrenner and Blunsom, 2013; Zhou et al, 2015); previous work has not thoroughly evaluated the use of context information for this task, and there is still a lack of good understanding about how we can use context information and how useful it is

Summary

Introduction

Dialog act (DA) represents a function of a speaker’s utterance in either human-to-human or human-to-computer conversations. Given the recent success of the deep learning framework in various language processing tasks, in this work we employ neural networks for DA classification Such models have been used in some recent studies for DA classification, e.g., (RojasBarahona et al, 2016; Kalchbrenner and Blunsom, 2013; Zhou et al, 2015); previous work has not thoroughly evaluated the use of context information for this task, and there is still a lack of good understanding about how we can use context information and how useful it is. This is the question we aim to answer in this work

Objectives

Methods

Results

Conclusion