Abstract

This paper presents a music recommendation system for the offline libraries of songs that employs the concepts of reinforcement learning to obtain satisfactory recommendations based on the various considered content-based parameters. In order to obtain insights about the effectiveness of the generated recommendations, parallel instances of single-play multi-arm bandit algorithms are maintained. In conjunction to this, the concepts of Bayesian learning are considered to model the user preferences, by assuming the environment’s reward generating process to be non-stationary and stochastic. The system is designed to be simple, easy to implement, and at-par with the user satisfaction, within the bounds of the input data capabilities.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call