Word Ordering Without Syntax

Allen Schmaltz,Stuart Shieber,Alexander M Rush

doi:10.18653/v1/d16-1255

Abstract

Recent work on word ordering has argued that syntactic structure is important, or even required, for effectively recovering the order of a sentence. We find that, in fact, an n-gram language model with a simple heuristic gives strong results on this task. Furthermore, we show that a long short-term memory (LSTM) language model is even more effective at recovering order, with our basic model outperforming a state-of-the-art syntactic model by 11.5 BLEU points. Additional data and larger beams yield further gains, at the expense of training and search time.

Highlights

We address the task of recovering the original word order of a shuffled sentence, referred to as bag generation (Brown et al, 1990), shake-and-bake generation (Brew, 1992), or more recently, linearization, as standardized in a recent line of research as a method useful for isolating the performance of text-to-text generation models (Zhang and Clark, 2011; Liu et al, 2015; Liu and Zhang, 2015; Zhang and Clark, 2015)
Selective restrictions, subcategorization, and discourse considerations are among the many factors which join together to fix the order in which words occur. . . [T]here is an abstract structure which underlies the surface strings and it is this structure which provides a more insightful basis for understanding the constraints on word order. . . . It is, an interesting question to ask whether a network can learn any aspects of that underlying abstract structure (Elman, 1990)
We find that language models are in general effective for linearization relative to existing syntactic approaches, with long short-term memory (LSTM) in particular outperforming the state-of-the-art by 11.5 BLEU points, with further gains observed when training with additional text and decoding with larger beams

Summary

Introduction

The predominant argument of the more recent works is that jointly recovering explicit syntactic structure is crucial for determining the correct word order of the original sentence. As such, these methods either generate or rely on given parse structure to reproduce the order. Elman judged the capacity of early recurrent neural networks via, in part, the network’s ability to predict word order in simple sentences. He notes, The order of words in sentences reflects a number of constraints.

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Word Ordering Without Syntax

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2016
Citations: 34	License type: cc-by

Similar Papers

An Empirical Comparison Between N-gram and Syntactic Language Models for Word Ordering
Jiangming Liu ... Yue Zhang
-
Jiangming Liu, et. al.Jiangming Liu ... Yue Zhang
01 Jan 2015
01 Jan 2015

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Yushi Aono
-
Ryo Masumura, et. al.Ryo Masumura ... Yushi Aono
01 Dec 2017
01 Dec 2017

Lattice rescoring strategies for long short term memory language models in speech recognition
Shankar Kumar ... Michael Nirschl
-
Shankar Kumar, et. al.Shankar Kumar ... Michael Nirschl
01 Dec 2017
01 Dec 2017

Federated Learning of N-Gram Language Models
Mingqing Chen ... Françoise Beaufays
-
Mingqing Chen, et. al.Mingqing Chen ... Françoise Beaufays
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Word Ordering Without Syntax

Abstract

Highlights

Summary

Talk to us

Similar Papers