CRUISE: Cold-Start New Skill Development via Iterative Utterance Generation

Yilin Shen,Avik Ray,Hongxia Jin,Abhishek Patel

doi:10.18653/v1/p18-4018

Abstract

We present a system, CRUISE, that guides ordinary software developers to build a high quality natural language understanding (NLU) engine from scratch. This is the fundamental step of building a new skill in personal assistants. Unlike existing solutions that require either developers or crowdsourcing to manually generate and annotate a large number of utterances, we design a hybrid rule-based and data-driven approach with the capability to iteratively generate more and more utterances. Our system only requires light human workload to iteratively prune incorrect utterances. CRUISE outputs a well trained NLU engine and a large scale annotated utterance corpus that third parties can use to develop their custom skills. Using both benchmark dataset and custom datasets we collected in real-world settings, we validate the high quality of CRUISE generated utterances via both competitive NLU performance and human evaluation. We also show the largely reduced human workload in terms of both cognitive load and human pruning time consumption.

Highlights

Intelligent voice-enabled personal assistants have been emerging in our daily life, such as Alexa, Google Assistant, Siri, Bixby, etc
The key of developing a new skill is to understand all varieties of user utterances and carry out the intent of users, referred to as natural language understanding (NLU) engine
Since users intend to use spoken language to interact with personal assistant agents, most industrial products are focused on spoken language understanding (SLU) in which it is sufficient to understand user query by classifying the intent and identifying a set of slots (Liu and Lane, 2016)

Summary

Introduction

Intelligent voice-enabled personal assistants have been emerging in our daily life, such as Alexa, Google Assistant, Siri, Bixby, etc. Researches have been made to bootstrap the utterance generations These approaches first generate canonical utterances based on either lexicon/grammar (Wang et al, 2015) or language/SQL templates (Iyer et al, 2017); utilize crowdsourcing to create paraphrases and correct labels. They require software developers to have natural language expertise and still heavily rely on costly crowdsourcing. Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics-System Demonstrations, pages 105–110 Melbourne, Australia, July 15 - 20, 2018. c 2018 Association for Computational Linguistics such, CRUISE does not depend on crowdsourcing to conduct the heavy task of manually generating utterances and annotating slots

Background and Related Work

Our Settings

CRUISE Design

Rule-based Iteration Design

Data-driven Tagged Sentence Filler

Experimental Evaluation

Objective Evaluation via NLU Engines

Subjective Human Evaluation

Findings

Human Workload Analysis

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CRUISE: Cold-Start New Skill Development via Iterative Utterance Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 21	License type: cc-by

Similar Papers

SkillBot: Towards Automatic Skill Development via User Demonstration.
Yilin Shen ... Hongxia Jin
-
Yilin Shen, et. al.Yilin Shen ... Hongxia Jin
01 Jan 2019
01 Jan 2019

ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset.
Zhihua Jin ... Qun Liu
IEEE transactions on visualization and computer graphics | VOL. 30
Zhihua Jin, et. al.Zhihua Jin ... Qun Liu
01 Jan 2024
IEEE transactions on visualization and computer graphics | VOL. 30

Constrained and Unconstrained Audio Classifıcation
T Prathima ... Sreeja Palla
-
T Prathima, et. al.T Prathima ... Sreeja Palla
01 Jan 2021
01 Jan 2021

Building the next generation of personal digital Assistants
P Milhorat ... S Schlogl
-
P Milhorat, et. al.P Milhorat ... S Schlogl
01 Mar 2014
01 Mar 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CRUISE: Cold-Start New Skill Development via Iterative Utterance Generation

Abstract

Highlights

Summary

Talk to us

Similar Papers