Learning the sound inventory of a complex vocal skill via an intrinsic reward.

Hazem Toutounji,Hazem Toutounji,Hazem Toutounji,Anja T Zai,Anja T Zai,Dina Lipkind,Richard H R Hahnloser,Richard H R Hahnloser,Ofer Tchernichovski

doi:10.1126/sciadv.adj3824

Abstract

Reinforcement learning (RL) is thought to underlie the acquisition of vocal skills like birdsong and speech, where sounding like one's "tutor" is rewarding. However, what RL strategy generates the rich sound inventories for song or speech? We find that the standard actor-critic model of birdsong learning fails to explain juvenile zebra finches' efficient learning of multiple syllables. However, when we replace a single actor with multiple independent actors that jointly maximize a common intrinsic reward, then birds' empirical learning trajectories are accurately reproduced. The influence of each actor (syllable) on the magnitude of global reward is competitively determined by its acoustic similarity to target syllables. This leads to each actor matching the target it is closest to and, occasionally, to the competitive exclusion of an actor from the learning process (i.e., the learned song). We propose that a competitive-cooperative multi-actor RL (MARL) algorithm is key for the efficient learning of the action inventory of a complex skill.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning the sound inventory of a complex vocal skill via an intrinsic reward.

Abstract

Talk to us

Similar Papers

More From: Science Advances

Lead the way for us

Journal: Science Advances	Publication Date: Mar 29, 2024
License type: cc-by-nc

Similar Papers

Learning models in interdependence situations
...
-
, et. al. ...
18 Nov 2015
18 Nov 2015

Author response: DYT1 dystonia increases risk taking in humans
David Arkadir ... Pietro Mazzoni
-
David Arkadir, et. al.David Arkadir ... Pietro Mazzoni
26 Apr 2016
26 Apr 2016

Biped dynamic walking using reinforcement learning
Hamid Benbrahim ... Judy A Franklin
Robotics and Autonomous Systems | VOL. 22
Hamid Benbrahim, et. al.Hamid Benbrahim ... Judy A Franklin
01 Dec 1997
Robotics and Autonomous Systems | VOL. 22

ACRE: Actor-Critic with Reward-Preserving Exploration
Athanasios Ch Kapoutsis ... Elias B Kosmatopoulos
Neural Computing and Applications | VOL. 35
Athanasios Ch Kapoutsis, et. al.Athanasios Ch Kapoutsis ... Elias B Kosmatopoulos
14 Aug 2023
Neural Computing and Applications | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning the sound inventory of a complex vocal skill via an intrinsic reward.

Abstract

Talk to us

Similar Papers

More From: Science Advances