Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate

Christo Kirov,Ryan Cotterell

doi:10.1162/tacl_a_00247

Abstract

Can advances in NLP help advance cognitive modeling? We examine the role of artificial neural networks, the current state of the art in many common NLP tasks, by returning to a classic case study. In 1986, Rumelhart and McClelland famously introduced a neural architecture that learned to transduce English verb stems to their past tense forms. Shortly thereafter in 1988, Pinker and Prince presented a comprehensive rebuttal of many of Rumelhart and McClelland’s claims. Much of the force of their attack centered on the empirical inadequacy of the Rumelhart and McClelland model. Today, however, that model is severely outmoded. We show that the Encoder-Decoder network architectures used in modern NLP systems obviate most of Pinker and Prince’s criticisms without requiring any simplification of the past tense mapping problem. We suggest that the empirical performance of modern networks warrants a reexamination of their utility in linguistic and cognitive modeling.

Highlights

In their famous 1986 opus, Rumelhart and McClelland (R&M) describe a neural network capable of transducing English verb stems to their past tense
The neural network approaches we advocate for achieve this goal, but do not clearly fall into either the single or dualroute category—internal computations performed by each network remain opaque, so we cannot at present make a claim whether two separable computation paths are present
We evaluate the performance of the Encoder-Decoder network architecture (ED) architecture in light of the criticisms P&P levied against the original R&M model

Summary

Introduction

In their famous 1986 opus, Rumelhart and McClelland (R&M) describe a neural network capable of transducing English verb stems to their past tense. State-of-the art morphological generation networks used in NLP, built from the modern evolution of recurrent neural networks (RNNs) explored by Elman (1990) and others, solve the same problem almost perfectly (Cotterell et al, 2016) This level of performance on a cognitively relevant problem suggests that it is time to consider further incorporating network modeling into the study of linguistics and cognitive science. We focus instead on an empirical assessment of the ability of a modern state-of-the-art neural architecture to learn linguistic patterns, asking the following questions: (i) Does the learner induce the full set of correct generalizations about the data? The results suggest that modern nets absolutely meet the first criterion, and often meet the second They do this given limited prior knowledge of linguistic structure: The networks we consider do not have phonological features built into them and must instead learn their own representations for input phonemes.

The English Past Tense

Acquisition of the Past Tense

Encoder-Decoder Architectures

Related Work

Non-neural Learners

Evaluation of the ED Learner

Experiment 1

Results and Discussion

Experiment 2

Results

Summary of Resolved and Outstanding Criticisms

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2018
Citations: 80	License type: cc-by

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

The English past tense debate in the Chinese EFL learners’ mind: a masked priming study
Wei Dawei ... Xu Jinbo
Chinese Journal of Applied Linguistics | VOL. 39
Wei Dawei, et. al.Wei Dawei ... Xu Jinbo
01 Jun 2016
Chinese Journal of Applied Linguistics | VOL. 39

How kids learn to say the darnedest things: the effect of multiple exemplar instruction on the emergence of novel verb usage.
R Douglas Greer ... Lynn Yuan
The Analysis of Verbal Behavior | VOL. 24
R Douglas Greer, et. al.R Douglas Greer ... Lynn Yuan
01 Apr 2008
The Analysis of Verbal Behavior | VOL. 24

Оппозиция определённого/неопределённого прошедшего времени в киргизском языке: семантические эффекты перевода повести Чингиза Айтматова «Белый пароход»
Begaiym Altynbekovna Altynbekova
Ethnic Culture | VOL. 4
Begaiym Altynbekovna AltynbekovaBegaiym Altynbekovna Altynbekova
27 Mar 2022
Ethnic Culture | VOL. 4

The influence of semantics on past tense inflection
Michael Ramscar
-
Michael RamscarMichael Ramscar
01 Jan 2001
01 Jan 2001

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recurrent Neural Networks in Linguistic Theory: Revisiting Pinker and Prince (1988) and the Past Tense Debate

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics