A Method for Sharing Cell State for LSTM-Based Language Model

Seongik Park,Yanggon Kim

doi:10.1007/978-3-030-25213-7_6

Abstract

With the rapid growth in the integration of AI technology in various Natural Language Processing, the demands on the development within the fields of Natural Language Understanding and Natural Language Generation have been rapidly increasing as well. Both of these techniques analyze language, as it is naturally spoken or written by users, and must contend with a degree of ambiguity not present in formal language. For that reason, Language Modeling has been used as a key role in this area. Recently, the emerging field of deep learning, which applies complex Deep Neural Networks for machine learning tasks, has been applied to language modeling. Long-Short Term Memory (LSTM), a type of a recurrent neural network, has been adopted and has achieved reasonable results than the traditional language models. However, although LSTM-based language models have shown reasonable results by memorizing preceding cells’ values, it is difficult to memorize all the information of preceding cells because they only use a 2-dimensional matrix to memorize all the information. To compensate for this limitation of memorizing problems, we propose a method for sharing cell state for a neural network-based language model, which considers all preceding cell states as a cell-stack. Our model achieved better performance compared to a traditional LSTM-based language model improving average perplexity scores from 133.88 to 124.32 for various time steps and from 141.29 to 133.62 for various hidden sizes, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Method for Sharing Cell State for LSTM-Based Language Model

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Syllable-based Myanmar language model for speech recognition
Wunna Soe ... Yadana Theins
-
Wunna Soe, et. al.Wunna Soe ... Yadana Theins
01 Jun 2015
01 Jun 2015

Future vector enhanced LSTM language model for LVCSR
Qi Liu ... Kai Yu
-
Qi Liu, et. al.Qi Liu ... Kai Yu
01 Dec 2017
01 Dec 2017

Predicting which cancer patients will see a psychiatrist or counsellor from their initial oncology consultation document using natural language processing.
John-Jose Nunez ... Raymond T Ng
Journal of Clinical Oncology | VOL. 41
John-Jose Nunez, et. al.John-Jose Nunez ... Raymond T Ng
01 Jun 2023
Journal of Clinical Oncology | VOL. 41

Major-Minor Long Short-Term Memory for Word-Level Language Model.
Kai Shuang ... Sen Su
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31
Kai Shuang, et. al.Kai Shuang ... Sen Su
05 Dec 2019
IEEE Transactions on Neural Networks and Learning Systems | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Method for Sharing Cell State for LSTM-Based Language Model

Abstract

Talk to us

Similar Papers