Organization of the state space of a simple recurrent network before and after training on recursive linguistic structures

Michal Čerňanský,Ľubica Beňušková,Matej Makula

doi:10.1016/j.neunet.2006.01.020

Michal Čerňanský, Ľubica Beňušková + Show 1 more

https://doi.org/10.1016/j.neunet.2006.01.020

Copy DOI

Abstract

Recurrent neural networks are often employed in the cognitive science community to process symbol sequences that represent various natural language structures. The aim is to study possible neural mechanisms of language processing and aid in development of artificial language processing systems. We used data sets containing recursive linguistic structures and trained the Elman simple recurrent network (SRN) for the next-symbol prediction task. Concentrating on neuron activation clusters in the recurrent layer of SRN we investigate the network state space organization before and after training. Given a SRN and a training stream, we construct predictive models, called neural prediction machines, that directly employ the state space dynamics of the network. We demonstrate two important properties of representations of recursive symbol series in the SRN. First, the clusters of recurrent activations emerging before training are meaningful and correspond to Markov prediction contexts. We show that prediction states that naturally arise in the SRN initialized with small random weights approximately correspond to states of Variable Memory Length Markov Models (VLMM) based on individual symbols (i.e. words). Second, we demonstrate that during training, the SRN reorganizes its state space according to word categories and their grammatical subcategories, and the next-symbol prediction is again based on the VLMM strategy. However, after training, the prediction is based on word categories and their grammatical subcategories rather than individual words. Our conclusion holds for small depths of recursions that are comparable to human performances. The methods of SRN training and analysis of its state space introduced in this paper are of a general nature and can be used for investigation of processing of any other symbol time series by means of SRN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Organization of the state space of a simple recurrent network before and after training on recursive linguistic structures

Abstract

Talk to us

Similar Papers

More From: Neural Networks

Lead the way for us

Journal: Neural Networks	Publication Date: May 9, 2006
Citations: 24

Similar Papers

Sequence Recognition with Recurrent Neural Networks
Arun Maskara ... Andrew Noetzel
Connection Science | VOL. 5
Arun Maskara, et. al.Arun Maskara ... Andrew Noetzel
01 Jan 1992
Connection Science | VOL. 5

Stack-like and queue-like dynamics in recurrent neural networks
André Grüning
Connection Science | VOL. 18
André GrüningAndré Grüning
01 Mar 2006
Connection Science | VOL. 18

Inference of Stochastic Regular Languages through Simple Recurrent Networks with Time Delays
Gustavo A Casañ ... M Asunción Castaño
-
Gustavo A Casañ, et. al.Gustavo A Casañ ... M Asunción Castaño
01 Jan 2001
01 Jan 2001

Simple recurrent networks as generalized hidden Markov models with distributed representations
Y Sakakibara ... M Golea
-
Y Sakakibara, et. al.Y Sakakibara ... M Golea
01 Dec 1995
01 Dec 1995

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Organization of the state space of a simple recurrent network before and after training on recursive linguistic structures

Abstract

Talk to us

Similar Papers

More From: Neural Networks