Online Learning of Robot Soccer Free Kick Plans Using a Bandit Approach

Juan Pablo Mendoza,Reid Simmons,Manuela Veloso

doi:10.1609/icaps.v26i1.13795

Abstract

This paper presents an online learning approach for teams of autonomous soccer robots to select free kick plans. In robot soccer, free kicks present an opportunity to execute plans with relatively controllable initial conditions. However, the effectiveness of each plan is highly dependent on the adversary, and there are few free kicks during each game, making it necessary to learn online from sparse observations. To achieve learning, we first greatly reduce the planning space by framing the problem as a contextual multi-armed bandit problem, in which the actions are a set of pre-computed plans, and the state is the position of the free kick on the field. During execution, we model the reward function for different free kicks using Gaussian Processes, and perform online learning using the Upper Confidence Bound algorithm. Results from a physics-based simulation reveal that the robots are capable of adapting to various different realistic opponents to maximize their expected reward during free kicks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Online Learning of Robot Soccer Free Kick Plans Using a Bandit Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling

Lead the way for us

Journal: Proceedings of the International Conference on Automated Planning and Scheduling	Publication Date: Mar 30, 2016
Citations: 8

Similar Papers

How Expert Confidence Can Improve Collective Decision-Making in Contextual Multi-Armed Bandit Problems
Axel Abels ... Vito Trianni
-
Axel Abels, et. al.Axel Abels ... Vito Trianni
01 Jan 2020
01 Jan 2020

Robot Soccer: A Multi-Robot Challenge
Manuela M. Veloso
-
Manuela M. VelosoManuela M. Veloso
01 Jan 2002
01 Jan 2002

Human behavior in contextual multi-armed bandit problems - BARCCSYN2015 presentation
...
-
, et. al. ...
22 Jun 2015
Human behavior in contextual multi-armed bandit problems - BARCCSYN2015 presentation
...

Human behavior in contextual multi-armed bandit problems - Poster presented at Reinforcement learning and decision making conference in 2015
...
-
, et. al. ...
14 Jun 2015
Human behavior in contextual multi-armed bandit problems - Poster presented at Reinforcement learning and decision making conference in 2015
...

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Learning of Robot Soccer Free Kick Plans Using a Bandit Approach

Abstract

Talk to us

Similar Papers

More From: Proceedings of the International Conference on Automated Planning and Scheduling