Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions

Danish Contractor,Barun Patra,Parag Singla,Mausam Mausam

doi:10.1017/s1351324920000017

Danish Contractor, Barun Patra + Show 2 more

Open Access

https://doi.org/10.1017/s1351324920000017

Copy DOI

Abstract

AbstractWe present the novel task of understanding multi-sentenceentity-seekingquestions (MSEQs), that is, the questions that may be expressed in multiple sentences, and that expect one or more entities as an answer. We formulate the problem of understanding MSEQs as a semantic labeling task over an open representation that makes minimal assumptions about schema or ontology-specific semantic vocabulary. At the core of our model, we use a BiLSTM (bidirectional LSTM) conditional random field (CRF), and to overcome the challenges of operating with low training data, we supplement it by using BERT embeddings, hand-designed features, as well as hard and soft constraints spanning multiple sentences. We find that this results in a 12–15 points gain over a vanilla BiLSTM CRF. We demonstrate the strengths of our work using the novel task of answering real-world entity-seeking questions from the tourism domain. The use of our labels helps answer 36% more questions with 35% more (relative) accuracy as compared to baselines. We also demonstrate how our framework can rapidly enable the parsing of MSEQs in an entirely new domain with small amounts of training data and little change in the semantic representation.

Highlights

We are motivated by the goal of building an information agent for tourists – one that would perform various roles of a travel agent, such as helping decide the city to visit, recommending points of interest, finding travel routes, and even creating optimized itineraries
We present the first system for the novel task of answering entity-seeking MSRQs from background corpora in the tourism domain
Before describing implementation details of our question understanding component, we present some background on Constrained Conditional Models (CCMs)(Chang et al, 2007) and BiDiLSTM+CRF(Huang et al, 2015) as these are at the core of our question understanding component

Summary

Introduction

We are motivated by the goal of building an information agent for tourists – one that would perform various roles of a travel agent, such as helping decide the city to visit, recommending points of interest, finding travel routes, and even creating optimized itineraries. We focus our paper on questions that are entity-seeking, i.e., expect one or more entities as answer These include the large fraction of tourist questions that ask for hotels, restaurants, points of interest and other services that would serve a user’s specific needs the best. A preliminary analysis of such questions from popular tourism forums reveals that almost all of them contain multiple sentences – they often elaborate on a user’s specific situation before asking their question. We name these MSRQs – multi-sentence recommendation questions. The querying module needs to incorporate the various constructs found in recommendation questions

Contributions

Related Work

System Architecture

RQL Representation

Question Understanding

Background on Sequence Labeling

Semantic Labeling of Questions

Supervised Labeling Conditional Random Field

Crowd-sourced Data Collection

Semi-Supervised Labeling

Evaluation

Results

Answer Generation

Conclusion and Future Work

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Natural Language Engineering	Publication Date: Feb 13, 2020
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Natural Language Engineering

Lead the way for us

Similar Papers

Semantic Language Model for Tunisian Dialect
Abir Masmoudi ... Rim Laatar
-
Abir Masmoudi, et. al.Abir Masmoudi ... Rim Laatar
22 Oct 2019
22 Oct 2019

Multilingual Speech Emotion Recognition on Japanese, English, and German
Panikos Heracleous ... Keiji Yasuda
-
Panikos Heracleous, et. al.Panikos Heracleous ... Keiji Yasuda
01 Jan 2023
01 Jan 2023

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

CRF-based bibliography extraction from reference strings using a small amount of training data
Daiki Namikoshi ... Manabu Ohta
-
Daiki Namikoshi, et. al.Daiki Namikoshi ... Manabu Ohta
01 Sep 2017
01 Sep 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Constrained BERT BiLSTM CRF for understanding multi-sentence entity-seeking questions

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Natural Language Engineering