Trajectory Generation for Multiprocess Robotic Tasks Based on Nested Dual-Memory Deep Deterministic Policy Gradient

Fengkang Ying,Xin Yin,Rongxin Jiang,Huashan Liu

doi:10.1109/tmech.2022.3160605

Abstract

Though there are extensive works on deep reinforcement learning (DRL) for robotics, sequential trajectory generation for multiprocess robotic tasks based on DRL is yet to be explored. In this article, the multiprocess task is formulated as a Markov decision process, and a nested dual-memory deep deterministic policy gradient algorithm with dynamic criteria is proposed, to generalize the traditional trajectory planning with predefined target point into a trajectory exploration problem aiming at a target area without solving inverse kinematics. First, a dual-memory architecture with local-to-global strategy is introduced to enhance the performance. Second, a novel nested architecture is proposed to generate sequential trajectory segments successively and asynchronously for the multiprocess task. Third, a compound reward system is designed and a weight coefficient matrix is adopted to balance the position control and the orientation control based on Tait–Bryan angles. In addition, a virtual twin system is established to promote the training efficiency, where the trajectory generated in simulation can be directly applied to the real physical platform. Finally, experimental results on both simulated and real-world applications have verified the performance of the proposed approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Trajectory Generation for Multiprocess Robotic Tasks Based on Nested Dual-Memory Deep Deterministic Policy Gradient

Abstract

Talk to us

Similar Papers

More From: IEEE/ASME Transactions on Mechatronics

Lead the way for us

Journal: IEEE/ASME Transactions on Mechatronics	Publication Date: Dec 1, 2022
Citations: 9

Similar Papers

UAV maneuvering decision -making algorithm based on Twin Delayed Deep Deterministic Policy Gradient Algorithm
Shuangxia Bai ... Evgeny Neretin
Journal of Artificial Intelligence and Technology | VOL. -
Shuangxia Bai, et. al.Shuangxia Bai ... Evgeny Neretin
07 Dec 2021
Journal of Artificial Intelligence and Technology | VOL. -

Morphing control of a new bionic morphing UAV with deep reinforcement learning
Dan Xu ... Gang Chen
Aerospace Science and Technology | VOL. 92
Dan Xu, et. al.Dan Xu ... Gang Chen
28 May 2019
Aerospace Science and Technology | VOL. 92

Method for remaining useful life prediction of rolling bearings based on deep reinforcement learning.
Yipeng Wang ... Denglong Wang
The Review of scientific instruments | VOL. 95
Yipeng Wang, et. al.Yipeng Wang ... Denglong Wang
01 Sep 2024
The Review of scientific instruments | VOL. 95

A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
Jiying Wu ... Luwei Liao
Machines | VOL. 10
Jiying Wu, et. al.Jiying Wu ... Luwei Liao
21 Jun 2022
Machines | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Trajectory Generation for Multiprocess Robotic Tasks Based on Nested Dual-Memory Deep Deterministic Policy Gradient

Abstract

Talk to us

Similar Papers

More From: IEEE/ASME Transactions on Mechatronics