Learning with incomplete information and the mathematical structure behind it

Reimer Kühn,Ion-Olimpiu Stamatescu

doi:10.1007/s00422-007-0162-4

Abstract

We investigate the problem of learning with incomplete information as exemplified by learning with delayed reinforcement. We study a two phase learning scenario in which a phase of Hebbian associative learning based on momentary internal representations is supplemented by an 'unlearning' phase depending on a graded reinforcement signal. The reinforcement signal quantifies the success-rate globally for a number of learning steps in phase one, and 'unlearning' is indiscriminate with respect to associations learnt in that phase. Learning according to this model is studied via simulations and analytically within a student-teacher scenario for both single layer networks and, for a committee machine. Success and speed of learning depend on the ratio lambda of the learning rates used for the associative Hebbian learning phase and for the unlearning-correction in response to the reinforcement signal, respectively. Asymptotically perfect generalization is possible only, if this ratio exceeds a critical value lambda( c ), in which case the generalization error exhibits a power law decay with the number of examples seen by the student, with an exponent that depends in a non-universal manner on the parameter lambda. We find these features to be robust against a wide spectrum of modifications of microscopic modelling details. Two illustrative applications-one of a robot learning to navigate a field containing obstacles, and the problem of identifying a specific component in a collection of stimuli-are also provided.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning with incomplete information and the mathematical structure behind it

Abstract

Talk to us

Similar Papers

More From: Biological Cybernetics

Lead the way for us

Journal: Biological Cybernetics	Publication Date: May 30, 2007
Citations: 4

Similar Papers

Contingency is Crucial for Creating Imitative Responses
C Catmur
Frontiers in Human Neuroscience | VOL. 5
C CatmurC Catmur
01 Jan 2010
Frontiers in Human Neuroscience | VOL. 5

From automaticity to control in bilinguals
Joseph Tzelgov ... Roi Cohen Kadosh
Trends in Cognitive Sciences | VOL. 13
Joseph Tzelgov, et. al.Joseph Tzelgov ... Roi Cohen Kadosh
10 Sep 2009
Trends in Cognitive Sciences | VOL. 13

TDCS Over the Motor Cortex Shows Differential Effects on Action and Object Words in Associative Word Learning in Healthy Aging.
Meret Branscheidt ... Gianpiero Liuzzi
Frontiers in Aging Neuroscience | VOL. 9
Meret Branscheidt, et. al.Meret Branscheidt ... Gianpiero Liuzzi
15 May 2017
Frontiers in Aging Neuroscience | VOL. 9

Effective neuronal learning with ineffective Hebbian learning rules.
Gal Chechik ... Isaac Meilijson
Neural Computation | VOL. 13
Gal Chechik, et. al.Gal Chechik ... Isaac Meilijson
01 Apr 2001
Neural Computation | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning with incomplete information and the mathematical structure behind it

Abstract

Talk to us

Similar Papers

More From: Biological Cybernetics