Rational Recurrences

Hao Peng,Noah A. Smith,Roy Schwartz,Sam Thomson

doi:10.18653/v1/d18-1152

Abstract

Despite the tremendous empirical success of neural models in natural language processing, many of them lack the strong intuitions that accompany classical machine learning approaches. Recently, connections have been shown between convolutional neural networks (CNNs) and weighted finite state automata (WFSAs), leading to new interpretations and insights. In this work, we show that some recurrent neural networks also share this connection to WFSAs. We characterize this connection formally, defining rational recurrences to be recurrent hidden state update functions that can be written as the Forward calculation of a finite set of WFSAs. We show that several recent neural models use rational recurrences. Our analysis provides a fresh view of these models and facilitates devising new neural architectures that draw inspiration from WFSAs. We present one such model, which performs better than two recent baselines on language modeling and text classification. Our results demonstrate that transferring intuitions from classical models like WFSAs can be an effective approach to designing and understanding neural models.

Highlights

Neural models, and in particular gated variants of recurrent neural networks (RNNs, e.g., Hochreiter and Schmidhuber, 1997; Cho et al, 2014), have become a core building block for stateof-the-art approaches in NLP (Goldberg, 2016)
In this work we show that many neural models are more interpretable than previously
We present a new model motivated by the interpolation of a two-state weighted finite-state automaton (WFSA) and a threestate one, capturing unigram and bigram features, respectively

Summary

Introduction

In particular gated variants of recurrent neural networks (RNNs, e.g., Hochreiter and Schmidhuber, 1997; Cho et al, 2014), have become a core building block for stateof-the-art approaches in NLP (Goldberg, 2016). While these models empirically outperform classical NLP methods on many tasks (Zaremba et al, 2014; Bahdanau et al, 2015; Dyer et al, 2016; Peng et al, 2017, inter alia), they typically lack the intuition offered by classical models, making it hard to understand the roles played by each of their components. We study several recently proposed RNN architectures and show that one can use WFSAs to characterize their recurrent updates We call such models rational recurrences (§3)..

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rational Recurrences

Abstract

Highlights

Summary

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jan 1, 2018
Citations: 54	License type: cc-by

Similar Papers

Abstract 2063: Deep natural language processing for automated ascertainment of cancer outcomes from clinician progress notes
Kenneth L Kehl ... Bruce E Johnson
Cancer Research | VOL. 80
Kenneth L Kehl, et. al.Kenneth L Kehl ... Bruce E Johnson
13 Aug 2020
Cancer Research | VOL. 80

NLP-Based Approach for Predicting HMI State Sequences Towards Monitoring Operator Situational Awareness.
Harsh V. P. Singh ... Qusay H. Mahmoud
Sensors (Basel, Switzerland) | VOL. 20
Harsh V. P. Singh, et. al.Harsh V. P. Singh ... Qusay H. Mahmoud
05 Jun 2020
Sensors (Basel, Switzerland) | VOL. 20

Identification of patients with carotid stenosis using natural language processing.
Xiao Wu ... Yuzhe Zhao
European Radiology | VOL. 30
Xiao Wu, et. al.Xiao Wu ... Yuzhe Zhao
26 Feb 2020
European Radiology | VOL. 30

The NLP Cookbook: Modern Recipes for Transformer Based Deep Learning Architectures
Sushant Singh ... Ausif Mahmood
IEEE Access | VOL. 9
Sushant Singh, et. al.Sushant Singh ... Ausif Mahmood
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rational Recurrences

Abstract

Highlights

Summary

Talk to us

Similar Papers