Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model

Sebastian J Mielke,Jason Eisner

doi:10.1609/aaai.v33i01.33016843

Abstract

We show how the spellings of known words can help us deal with unknown words in open-vocabulary NLP tasks. The method we propose can be used to extend any closedvocabulary generative model, but in this paper we specifically consider the case of neural language modeling. Our Bayesian generative story combines a standard RNN language model (generating the word tokens in each sentence) with an RNNbased spelling model (generating the letters in each word type). These two RNNs respectively capture sentence structure and word structure, and are kept separate as in linguistics. By invoking the second RNN to generate spellings for novel words in context, we obtain an open-vocabulary language model. For known words, embeddings are naturally inferred by combining evidence from type spelling and token context. Comparing to baselines (including a novel strong baseline), we beat previous work and establish state-of-the-art results on multiple datasets.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: Jul 17, 2019
Citations: 30

Similar Papers

A Vietnamese language model based on Recurrent Neural Network
Viet-Trung Tran ... Duc-Hanh Bui
-
Viet-Trung Tran, et. al.Viet-Trung Tran ... Duc-Hanh Bui
01 Oct 2016
01 Oct 2016

Generating Sentences from a Continuous Space
Samuel R Bowman ... Oriol Vinyals
-
Samuel R Bowman, et. al.Samuel R Bowman ... Oriol Vinyals
01 Jan 2015
01 Jan 2015

Improving the role of language model in statistical machine translation (Indonesian-Javanese)
Herry Sujaini
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10
Herry SujainiHerry Sujaini
01 Apr 2020
International Journal of Electrical and Computer Engineering (IJECE) | VOL. 10

Factored neural language models
Andrei Alexandrescu ... Katrin Kirchhoff
-
Andrei Alexandrescu, et. al.Andrei Alexandrescu ... Katrin Kirchhoff
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spell Once, Summon Anywhere: A Two-Level Open-Vocabulary Language Model

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence