An Evaluation of Chinese Human-Computer Dialogue Technology

Zhengyu Zhao,Yibo Zhang,Zhigang Chen,Weinan Zhang,Wanxiang Che

doi:10.1162/dint_a_00007

Abstract

The human-computer dialogue has recently attracted extensive attention from both academia and industry as an important branch in the field of artificial intelligence (AI). However, there are few studies on the evaluation of large-scale Chinese human-computer dialogue systems. In this paper, we introduce the Second Evaluation of Chinese Human-Computer Dialogue Technology, which focuses on the identification of a user's intents and intelligent processing of intent words. The Evaluation consists of user intent classification (Task 1) and online testing of task-oriented dialogues (Task 2), the data sets of which are provided by iFLYTEK Corporation. The evaluation tasks and data sets are introduced in detail, and meanwhile, the evaluation results and the existing problems in the evaluation are discussed.

Highlights

With the development of artificial intelligence, human-computer dialogue technology has become increasingly popular and has attracted growing attention [1]
The Evaluation consists of user intent classification (Task 1) and online testing of task-oriented dialogues (Task 2), the data sets of which are provided by iFLYTEK Corporation
In order to avoid the imbalance of category distribution and take into account each category, we evaluate submitted systems based on the F1-measure obtained from precision and recall

Summary

INTRODUCTION

With the development of artificial intelligence, human-computer dialogue technology has become increasingly popular and has attracted growing attention [1]. Task 1 in the 17th China National Conference on Computational Linguistics (CCL2018) , which is based on Chinese corpora, is a user intent classification task in the customer service field They provide some open data to allow participants to build systems and test them on hidden data sets. In DSTC6, participants need to build a system that responds to a user’s utterances based on the context of the conversation, where they can use external data Both objective and subjective indicators are used to evaluate the submitted systems [11]. The submitted systems should complete the corresponding tasks about tickets inquiring or reservation through online real-time dialogues with testers This Evaluation has automatic evaluation (for user intent classification tasks) and online manual testing (for online testing of task-oriented dialogues).

Task 1

Task 2

EVALUATION OF DATA SETS

Analysis

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Intelligence	Publication Date: May 1, 2019
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

An Evaluation of Chinese Human-Computer Dialogue Technology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Intelligence

Lead the way for us

Similar Papers

Towards AGI: Cognitive Architecture Based on Hybrid and Bionic Principles
R V Dushkin
-
R V DushkinR V Dushkin
13 Jul 2021
13 Jul 2021

Consideration of breakthrough technologies in the field of genomic research and artificial intelligence in healthcare
L.V Chkhutiashvili
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -
L.V ChkhutiashviliL.V Chkhutiashvili
01 Nov 2021
Buhuchet v zdravoohranenii (Accounting in Healthcare) | VOL. -

An Evaluation of Chinese Human-Computer Dialogue Technology
Zixian Feng ... Wanxiang Che
Data Intelligence | VOL. 3
Zixian Feng, et. al.Zixian Feng ... Wanxiang Che
02 Jun 2021
Data Intelligence | VOL. 3

Analysis of the World Experience in the Use of Artificial Intelligence to Optimize Business Processes of Enterprises
K I Dementev
Administrative Consulting | VOL. -
K I DementevK I Dementev
24 Feb 2023
Administrative Consulting | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Evaluation of Chinese Human-Computer Dialogue Technology

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data Intelligence