Tailoring an Interpretable Neural Language Model

Yike Zhang,Pengyuan Zhang,Yonghong Yan

doi:10.1109/taslp.2019.2913087

Abstract

Neural networks have shown great potential in language modeling. Currently, the dominant approach to language modeling is based on recurrent neural networks (RNNs) and convolutional neural networks (CNNs). Nonetheless, it is not clear why RNNs and CNNs are suitable for the language modeling task since these neural models are lack of interpretability. The goal of this paper is to tailor an interpretable neural model as an alternative to RNNs and CNNs for the language modeling task. This paper proposes a unified framework for language modeling, which can partly interpret the rationales behind existing language models (LMs). Based on the proposed framework, an interpretable neural language model (INLM) is proposed, including a tailored architectural structure and a tailored learning method for the language modeling task. The proposed INLM can be approximated as a parameterized auto-regressive moving average model and provides interpretability in two aspects: component interpretability and prediction interpretability. Experiments demonstrate that the proposed INLM outperforms some typical neural LMs on several language modeling datasets and on the switchboard speech recognition task. Further experiments also show that the proposed INLM is competitive with the state-of-the-art long short-term memory LMs on the Penn Treebank and WikiText-2 datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tailoring an Interpretable Neural Language Model

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Jul 1, 2019
Citations: 42

Similar Papers

Lattice rescoring strategies for long short term memory language models in speech recognition
Shankar Kumar ... Daniel Holtmann-Rice
-
Shankar Kumar, et. al.Shankar Kumar ... Daniel Holtmann-Rice
01 Dec 2017
01 Dec 2017

Research Trends on Deep Transformation Neural Models for Text Analysis in NLP Applications
T Chellatamilan ... K Santhi
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 9
T Chellatamilan, et. al.T Chellatamilan ... K Santhi
30 Jul 2020
International Journal of Recent Technology and Engineering (IJRTE) | VOL. 9

Hybrid Inception Recurrent Residual Convolutional Neural Network (HIRResCNN) with Harmony Search Optimization (HSO) for Early Breast Cancer Detection System
K Sangeetha ... S Prakash
NeuroQuantology | VOL. 19
K Sangeetha, et. al.K Sangeetha ... S Prakash
11 Aug 2021
NeuroQuantology | VOL. 19

INTELLIGENT MODEL FOR CLASSIFYING HEMODYNAMIC PATTERNS OF BRAIN ACTIVATION TO IDENTIFY NEUROCOGNITIVE MECHANISMS OF SPATIAL-NUMERICAL ASSOCIATIONS
R G Asadullaev ... M A Sitnikova
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -
R G Asadullaev, et. al.R G Asadullaev ... M A Sitnikova
01 Jan 2024
Vestnik komp'iuternykh i informatsionnykh tekhnologii | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tailoring an Interpretable Neural Language Model

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing