Sentence model based subword embeddings for a dialog system

Euisok Chung,Hwa Jeon Song,Hyun Woo Kim

doi:10.4218/etrij.2020-0245

Euisok Chung, Hwa Jeon Song + Show 1 more

Open Access

https://doi.org/10.4218/etrij.2020-0245

Copy DOI

Journal: ETRI Journal	Publication Date: May 12, 2022
Citations: 3	License type: publisher-specific license

Affiliation: Electronics and Telecommunications Research Institute

Abstract

This study focuses on improving a word embedding model to enhance the performance of downstream tasks, such as those of dialog systems. To improve traditional word embedding models, such as skip-gram, it is critical to refine the word features and expand the context model. In this paper, we approach the word model from the perspective of subword embedding and attempt to extend the context model by integrating various sentence models. Our proposed sentence model is a subword-based skip-thought model that integrates self-attention and relative position encoding techniques. We also propose a clustering-based dialog model for downstream task verification and evaluate its relationship with the sentence-model-based subword embedding technique. The proposed subword embedding method produces better results than previous methods in evaluating word and sentence similarity. In addition, the downstream task verification, a clustering-based dialog system, demonstrates an improvement of up to 4.86% over the results of FastText in previous research.

Full Text