Generating Sentences from a Continuous Space

Samuel R Bowman,Andrew Dai,Samy Bengio,Oriol Vinyals,Luke Vilnis,Rafal Jozefowicz

doi:10.18653/v1/k16-1002

Abstract

The standard recurrent neural network language model (RNNLM) generates sentences one word at a time and does not work from an explicit global sentence representation. In this work, we introduce and study an RNN-based variational autoencoder generative model that incorporates distributed latent representations of entire sentences. This factorization allows it to explicitly model holistic properties of sentences such as style, topic, and high-level syntactic features. Samples from the prior over these sentence representations remarkably produce diverse and well-formed sentences through simple deterministic decoding. By examining paths through this latent space, we are able to generate coherent novel sentences that interpolate between known sentences. We present techniques for solving the difficult learning problem presented by this model, demonstrate its effectiveness in imputing missing words, explore many interesting properties of the model's latent sentence space, and present negative results on the use of the model in language modeling.

Highlights

IntroductionRecurrent neural network language models (rnnlms, Mikolov et al, 2011) represent the state of the art in unsupervised generative modeling for natural language sentences
Recurrent neural network language models represent the state of the art in unsupervised generative modeling for natural language sentences
Drawing inspiration from recent successes in modeling images (Gregor et al, 2015), handwriting, and natural speech (Chung et al, 2015), our model circumvents these difficulties using the architecture of a variational autoencoder and takes advantage of recent advances in variational inference (Kingma and Welling, 2015; Rezende et al, 2014) that introduce a practical training technique for powerful neural network generative models with latent variables

Summary

Introduction

Recurrent neural network language models (rnnlms, Mikolov et al, 2011) represent the state of the art in unsupervised generative modeling for natural language sentences. Rnnlm decoders conditioned on taskspecific features are the state of the art in tasks like machine translation (Sutskever et al, 2014; Bahdanau et al, 2015) and image captioning (Vinyals et al, 2015; Mao et al, 2015; Donahue et al, 2015). The rnnlm generates sentences word-by-word based on an evolving distributed state representation, which makes it a probabilistic model with no significant independence i went to the store to buy some groceries . A standard rnn language model predicts each word of a sentence conditioned on the previous word and an evolving hidden state. While effective, it does not learn a vector representation of the full sentence. In the case of a sequence autoencoder, both encoder and decoder are rnns and examples are token sequences

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Generating Sentences from a Continuous Space

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2016
Citations: 1607	License type: cc-by

Similar Papers

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Yushi Aono
-
Ryo Masumura, et. al.Ryo Masumura ... Yushi Aono
01 Dec 2017
01 Dec 2017

Training RNN language models on uncertain ASR hypotheses in limited data scenarios
Imran Sheikh ... Irina Illina
Computer Speech & Language | VOL. 83
Imran Sheikh, et. al.Imran Sheikh ... Irina Illina
20 Aug 2023
Computer Speech & Language | VOL. 83

Improvements to N-gram Language Model Using Text Generated from Neural Language Model
Masayuki Suzuki ... Nobuyasu Itoh
-
Masayuki Suzuki, et. al.Masayuki Suzuki ... Nobuyasu Itoh
01 May 2019
01 May 2019

Investigation of back-off based interpolation between recurrent neural network and n-gram language models
X Chen ... P C Woodland
-
X Chen, et. al.X Chen ... P C Woodland
01 Dec 2015
01 Dec 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Generating Sentences from a Continuous Space

Abstract

Highlights

Summary

Talk to us

Similar Papers