Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.

Jiangcheng Zhu,Jun Zhu,Zhepei Wang,Shan Guo,Chao Xu

doi:10.1109/tnnls.2018.2844466

Abstract

This paper proposes a hierarchical decision-making and control algorithm for the shepherd game, the seventh mission in the International Aerial Robotics Competition (IARC). In this game, the agent (a multirotor aerial robot) is required to contact targets (ground vehicles) sequentially and drive them to a certain boundary to earn score. During the game of 10 min, the agent should be fully autonomous without any human interference. Regarding the lower-level controller and dynamics of the agent, each action takes a duration of time to accomplish. Denoted as an action delay, in this paper, this action duration is nonconstant and is related to the final reward. Therefore, the challenging point is making the agent "aware of time" when applying a certain action. We solve this problem by two approaches: deep Q-networks and lookup table. The action delay predictor in the decision-level is fitted by a lower-level controller. Through simulations by the example of the shepherd game, the effectiveness and efficiency of this approach are validated. This paper helps our team winning the first prize in IARC 2017, and keeps the best record of this mission since it was released in 2013.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems

Lead the way for us

Journal: IEEE transactions on neural networks and learning systems	Publication Date: Jul 2, 2018
Citations: 25

Similar Papers

Air-to-ground shepherd problem: An action-delay reinforcement learning approach
Jiangcheng Zhu ... Chao Xu
-
Jiangcheng Zhu, et. al.Jiangcheng Zhu ... Chao Xu
01 May 2017
01 May 2017

Dance of the Dragonfly: A Vision-Based Agile Aerial Touch Solution for IARC Mission 7
Ziliang Lai ... Wenjun Deng
-
Ziliang Lai, et. al.Ziliang Lai ... Wenjun Deng
01 Aug 2018
01 Aug 2018

Hierarchical Velocity Control Based on Differential Flatness for a DC/DC Buck Converter-DC Motor System
R Silva-Ortigoza ... F Carrizosa-Corral
Mathematical Problems in Engineering | VOL. 2014
R Silva-Ortigoza, et. al.R Silva-Ortigoza ... F Carrizosa-Corral
01 Jan 2014
Mathematical Problems in Engineering | VOL. 2014

A Real-Time 3D Path Planning Solution for Collision-Free Navigation of Multirotor Aerial Robots in Dynamic Environments
Jose Luis Sanchez-Lopez ... Holger Voos
Journal of Intelligent & Robotic Systems | VOL. 93
Jose Luis Sanchez-Lopez, et. al.Jose Luis Sanchez-Lopez ... Holger Voos
07 Apr 2018
Journal of Intelligent & Robotic Systems | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hierarchical Decision and Control for Continuous Multitarget Problem: Policy Evaluation With Action Delay.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on neural networks and learning systems