Recurrent Neural Network based language modeling with controllable external Memory

Wei-Jen Ko,Bo-Hsiang Tseng,Hung-Yi Lee

doi:10.1109/icassp.2017.7953249

Abstract

It is crucial for language models to model long-term dependency in word sequences, which can be achieved to some good extent by recurrent neural network (RNN) based language models with long short-term memory (LSTM) units. To accurately model the sophisticated long-term information in human languages, large memory in language models is necessary. However, the size of RNN-based language models cannot be arbitrarily increased because the computational resources required and the model complexity will also be increase accordingly, due to the limitation of the structure. To overcome this problem, inspired from Neural Turing Machine and Memory Network, we equip RNN-based language models with controllable external memory. With a learnable memory controller, the size of the external memory is independent to the number of model parameters, so the proposed language model can have larger memory without increasing the parameters. In the experiments, the proposed model yielded lower perplexities than RNN-based language models with LSTM units on both English and Chinese corpora.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recurrent Neural Network based language modeling with controllable external Memory

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A New Delay Connection for Long Short-Term Memory Networks.
Jianyong Wang ... Lei Zhang
International journal of neural systems | VOL. 28
Jianyong Wang, et. al.Jianyong Wang ... Lei Zhang
24 Jun 2018
International journal of neural systems | VOL. 28

Wind Power Prediction Based on Recurrent Neural Network with Long Short-Term Memory Units
Danting Dong ... Tiancheng Yang
-
Danting Dong, et. al.Danting Dong ... Tiancheng Yang
01 Nov 2018
01 Nov 2018

STANN: A Spatio–Temporal Attentive Neural Network for Traffic Prediction
Zhixiang He ... Chi-Yin Chow
IEEE Access | VOL. 7
Zhixiang He, et. al.Zhixiang He ... Chi-Yin Chow
01 Jan 2019
IEEE Access | VOL. 7

Language Model Using Neural Turing Machine Based on Localized Content-Based Addressing
Donghyun Lee ... Jeong-Sik Park
Applied Sciences | VOL. 10
Donghyun Lee, et. al.Donghyun Lee ... Jeong-Sik Park
15 Oct 2020
Applied Sciences | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Neural Network based language modeling with controllable external Memory

Abstract

Talk to us

Similar Papers