How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing

Abhilasha Ravichander,Jonathan Francis,Eric Nyberg,Thomas Manzini,Matthias Grabmair,Graham Neubig

doi:10.18653/v1/w17-5545

Abstract

Building dialogue interfaces for real-world scenarios often entails training semantic parsers starting from zero examples. How can we build datasets that better capture the variety of ways users might phrase their queries, and what queries are actually realistic? Wang et al. (2015) proposed a method to build semantic parsing datasets by generating canonical utterances using a grammar and having crowdworkers paraphrase them into natural wording. A limitation of this approach is that it induces bias towards using similar language as the canonical utterances. In this work, we present a methodology that elicits meaningful and lexically diverse queries from users for semantic parsing tasks. Starting from a seed lexicon and a generative grammar, we pair logical forms with mixed text-image representations and ask crowdworkers to paraphrase and confirm the plausibility of the queries that they generated. We use this method to build a semantic parsing dataset from scratch for a dialog agent in a smart-home simulation. We find evidence that this dataset, which we have named SmartHome, is demonstrably more lexically diverse and difficult to parse than existing domain-specific semantic parsing datasets.

Highlights

Semantic parsing is the task of mapping natural language utterances to their underlying meaning representations
Because the canonical utterances may be ungrammatical or stilted, they are paraphrased by crowd workers to be more natural queries in the target language. We argue that this approach has three limitations when constructing semantic parsers for new domains: (1) the seed utterances may induce bias towards the language of the canonical utterance, with regards to lexical choice, (2) the generic grammar suggested cannot be used to generate all the queries we may want to support in a new domain, and (3) there is no check on the correctness or naturalness of the canonical utterances themselves, which may not be logically plausible
In order to examine the lexical diversity in the original dataset, we examine the ratio of the total number of word types seen in the natural language representations to the total number of token types in the meaning representation

Summary

Introduction

Semantic parsing is the task of mapping natural language utterances to their underlying meaning representations. This is an essential component for many tasks that require understanding natural language dialogue Orienting a dialogue-capable intelligent system is accomplished by training its semantic parser with utterances that capture the nuances of the domain. An inherent challenge lies in building datasets that have enough lexical diversity for granting the system robustness against natural language variation in query-based dialogue. With the advent of datadriven methods for semantic parsing (Dong and Lapata, 2016; Jia and Liang, 2016), constructing such realistic and sufficient-sized dialog datasets for specific domains becomes especially important, and is often the bottleneck for applying semantic parsers to new tasks

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2017
Citations: 25	License type: cc-by

Similar Papers

Unfreeze with Care: Space-Efficient Fine-Tuning of Semantic Parsing Models
Weiqi Sun ... Nicolas Guenon Des Mesnards
-
Weiqi Sun, et. al.Weiqi Sun ... Nicolas Guenon Des Mesnards
25 Apr 2022
25 Apr 2022

TaPas: Weakly Supervised Table Parsing via Pre-training
Jonathan Herzig ... Julian Eisenschlos
-
Jonathan Herzig, et. al.Jonathan Herzig ... Julian Eisenschlos
01 Jan 2020
01 Jan 2020

Data Recombination for Neural Semantic Parsing
Robin Jia ... Percy Liang
-
Robin Jia, et. al.Robin Jia ... Percy Liang
01 Jan 2015
01 Jan 2015

Training Naturalized Semantic Parsers with Very Little Data
Subendhu Rongali ... Konstantine Arkoudas
-
Subendhu Rongali, et. al.Subendhu Rongali ... Konstantine Arkoudas
01 Jul 2022
01 Jul 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How Would You Say It? Eliciting Lexically Diverse Dialogue for Supervised Semantic Parsing

Abstract

Highlights

Summary

Talk to us

Similar Papers