The Use of Apprenticeship Learning Via Inverse Reinforcement Learning for Generating Melodies

Orry M Messer ,Pravesh Ranchod

doi:10.5281/zenodo.850991

Abstract

The research presented in this paper uses apprenticeship learning via inverse reinforcement learning to ascertain a reward function in a musical context. The learning agent then used this reward function to generate new melodies using reinforcement learning. Reinforcement learning is a type of unsupervised machine learning where rewards are used to guide an agent’s learning. These rewards are usually manually specified. However, in the musical setting it is difficult to manually do so. Apprenticeship learning via inverse reinforcement learning can be used in these difficult cases to ascertain a reward function. In order to ascertain a reward function, the learning agent needs examples of expert behaviour. Melodies generated by the authors were used as expert behaviour in this research from which the learning agent discovered a reward function and subsequently used this reward function to generate new melodies. This paper is presented as a proof of concept; the results show that this approach can be used to generate new melodies although further work needs to be undertaken in order to build upon the rudimentary learning agent presented here.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Use of Apprenticeship Learning Via Inverse Reinforcement Learning for Generating Melodies

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Sample-Efficient I-Projections for Robot Learning

-

19 Apr 2021
19 Apr 2021

A survey of inverse reinforcement learning
Stephen Adams ... Tyler Cody
Artificial Intelligence Review | VOL. 55
Stephen Adams, et. al.Stephen Adams ... Tyler Cody
08 Feb 2022
Artificial Intelligence Review | VOL. 55

Proposal and Evaluation of the Improved Penalty Avoiding Rational Policy Making Algorithm
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Efficient learning of relational models for sequential decision making
...
-
, et. al. ...
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Use of Apprenticeship Learning Via Inverse Reinforcement Learning for Generating Melodies

Abstract

Talk to us

Similar Papers