Real-time distributed non-myopic task selection for heterogeneous robotic teams

Andrew J Smith,Javier Yu,Geoffrey A Hollinger,Graeme Best

doi:10.1007/s10514-018-9811-9

Abstract

In this paper we introduce a novel algorithm for online distributed non-myopic task-selection in heterogeneous robotic teams. Our algorithm uses a temporal probabilistic representation that allows agents to evaluate their actions in the team’s joint action space while robots individually search their own action space. We use Monte-Carlo tree search to asymmetrically search through the robot’s individual action space while accounting for the probable future actions of their team members using the condensed temporal representation. This allows a distributed team of robots to non-myopically coordinate their actions in real-time. Our developed method can be applied across a wide range of tasks, robot team compositions, and reward functions. To evaluate our coordination method, we implemented it for a series of simulated and fielded hardware trials where we found that our coordination method is able to increase the cumulative team reward by a maximum of $$47.2\%$$ in the simulated trials versus a distributed auction-based coordination. We also performed several outdoor hardware trials with a team of three quadcopters that increased the maximum cumulative reward by $$24.5\%$$ versus a distributed auction-based coordination.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-time distributed non-myopic task selection for heterogeneous robotic teams

Abstract

Talk to us

Similar Papers

More From: Autonomous Robots

Lead the way for us

Journal: Autonomous Robots	Publication Date: Nov 7, 2018
Citations: 12

Similar Papers

BiC-DDPG: Bidirectionally-Coordinated Nets for Deep Multi-agent Reinforcement Learning
Gongju Wang ... Dianxi Shi
-
Gongju Wang, et. al.Gongju Wang ... Dianxi Shi
01 Jan 2020
01 Jan 2020

Reinforcement learning with dynamic completion for answering multi-hop questions over incomplete knowledge graph
Hai Cui ... Lu Liu
Information Processing & Management | VOL. 60
Hai Cui, et. al.Hai Cui ... Lu Liu
23 Jan 2023
Information Processing & Management | VOL. 60

Reinforcement Learning for the Agile Earth-Observing Satellite Scheduling Problem
Adam Herrmann ... Hanspeter Schaub
IEEE Transactions on Aerospace and Electronic Systems | VOL. -
Adam Herrmann, et. al.Adam Herrmann ... Hanspeter Schaub
01 Jan 2023
IEEE Transactions on Aerospace and Electronic Systems | VOL. -

An RL approach to common-interest continuous action games
...
-
, et. al. ...
04 Jun 2012
04 Jun 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-time distributed non-myopic task selection for heterogeneous robotic teams

Abstract

Talk to us

Similar Papers

More From: Autonomous Robots