The Impact of Semi-Supervised Learning on the Performance of Intelligent Chatbot System

Sudan Prasad Uprety,Seung Ryul Jeong

doi:10.32604/cmc.2022.023127

Abstract

Artificial intelligent based dialog systems are getting attention from both business and academic communities. The key parts for such intelligent chatbot systems are domain classification, intent detection, and named entity recognition. Various supervised, unsupervised, and hybrid approaches are used to detect each field. Such intelligent systems, also called natural language understanding systems analyze user requests in sequential order: domain classification, intent, and entity recognition based on the semantic rules of the classified domain. This sequential approach propagates the downstream error; i.e., if the domain classification model fails to classify the domain, intent and entity recognition fail. Furthermore, training such intelligent system necessitates a large number of user-annotated datasets for each domain. This study proposes a single joint predictive deep neural network framework based on long short-term memory using only a small user-annotated dataset to address these issues. It investigates value added by incorporating unlabeled data from user chatting logs into multi-domain spoken language understanding systems. Systematic experimental analysis of the proposed joint frameworks, along with the semi-supervised multi-domain model, using open-source annotated and unannotated utterances shows robust improvement in the predictive performance of the proposed multi-domain intelligent chatbot over a base joint model and joint model based on adversarial learning.

Highlights

Natural language understanding (NLU) and Speech understanding (SU) play a significantly important role in human-computer interaction (HCI) applications
This study reduces human efforts for manual annotation of utterances by incorporating unannotated datasets from various data sources, such as user query logs into a deep neural network (DNN) algorithm, i.e., a single jointly trained long short-term memory (LSTM) based NLU model of a multi-domain intelligent chatbot
We proposed a semi-supervised joint model, SEMI-MDJM, for intelligent chatbot system to extract the domain, intent, and entity of user queries using a single model machine learning (ML) model based on LSTM to mitigate the propagation of downstream error

Summary

Introduction

Natural language understanding (NLU) and Speech understanding (SU) play a significantly important role in human-computer interaction (HCI) applications. Organizations are struggling to manage the growth of such user query data They have been implementing intelligent chatbot to provide service to customers 24/7 with or without call center help to address these issues. Such intelligent systems have three most important parts: domain classification, intent detection, and entity recognition. This study reduces human efforts for manual annotation of utterances by incorporating unannotated datasets from various data sources, such as user query logs into a DNN algorithm, i.e., a single jointly trained long short-term memory (LSTM) based NLU model of a multi-domain intelligent chatbot. A single semi-supervised multi-domain joint model (SEMI-MDJM) based on LSTM outperforms a joint base model and an adversarial multi-domain joint model in each task i.e., domain classification, intent prediction, and entity recognition

Literature Review

Domain Prediction

Intent Detection

Entity Extraction or Slot Filling

Joint Training for Multi-Domain Intelligent Chatbot System

Adversarial Learning

Semi-Supervised Learning for NLU

Embedding and Bi-LSTM Layer

Evaluation Criteria

Optimization

Experiment

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Computers, Materials & Continua	Publication Date: Jan 1, 2022
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

The Impact of Semi-Supervised Learning on the Performance of Intelligent Chatbot System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials & Continua

Lead the way for us

Similar Papers

Adversarial Training for Multi Domain Dialog System
Sudan Prasad Uprety ... Seung Ryul Jeong
Intelligent Automation & Soft Computing | VOL. 31
Sudan Prasad Uprety, et. al.Sudan Prasad Uprety ... Seung Ryul Jeong
01 Jan 2021
Intelligent Automation & Soft Computing | VOL. 31

Adversarial Multi-task Learning for Efficient Chinese Named Entity Recognition
Yibo Yan ... Fangzhou Yang
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Yibo Yan, et. al.Yibo Yan ... Fangzhou Yang
20 Jul 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Efficient intent classification and entity recognition for university administrative services employing deep learning models
S Rizou ... K.Ch Chatzisavvas
Intelligent Systems with Applications | VOL. 19
S Rizou, et. al.S Rizou ... K.Ch Chatzisavvas
07 Jun 2023
Intelligent Systems with Applications | VOL. 19

ZH-NER: Chinese Named Entity Recognition with Adversarial Multi-task Learning and Self-Attentions
Peng Zhu ... Yifeng Luo
-
Peng Zhu, et. al.Peng Zhu ... Yifeng Luo
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Impact of Semi-Supervised Learning on the Performance of Intelligent Chatbot System

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Computers, Materials &amp; Continua

More From: Computers, Materials & Continua