Addressing Action Oscillations through Learning Policy Inertia

Chen Chen,Wulong Liu,Jianye Hao,Zhaopeng Meng,Hongyao Tang

doi:10.1609/aaai.v35i8.16864

Abstract

Deep reinforcement learning (DRL) algorithms have been demonstrated to be effective on a wide range of challenging decision making and control tasks. However, these methods typically suffer from severe action oscillations in particular in discrete action setting, which means that agents select different actions within consecutive steps even though states only slightly differ. This issue is often neglected since we usually evaluate the quality of a policy using cumulative rewards only. Action oscillation strongly affects the user experience and even causes serious potential security menace especially in real-world domains with the main concern of safety, such as autonomous driving. In this paper, we introduce Policy Inertia Controller (PIC) which serves as a generic plug-in framework to off-the-shelf DRL algorithms, to enable adaptive balance between the optimality and smoothness in a formal way. We propose Nested Policy Iteration as a general training algorithm for PIC-augmented policy which ensures monotonically non-decreasing updates.Further, we derive a practical DRL algorithm, namely Nested Soft Actor-Critic. Experiments on a collection of autonomous driving tasks and several Atari games suggest that our approach demonstrates substantial oscillation reduction than a range of commonly adopted baselines with almost no performance degradation.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Addressing Action Oscillations through Learning Policy Inertia

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 4

Similar Papers

An explainable deep reinforcement learning algorithm for the parameter configuration and adjustment in the consortium blockchain
Zhonghao Zhai ... Yanqin Mao
Engineering Applications of Artificial Intelligence | VOL. 129
Zhonghao Zhai, et. al.Zhonghao Zhai ... Yanqin Mao
30 Nov 2023
Engineering Applications of Artificial Intelligence | VOL. 129

Medical Equipment Supply Chain Optimization and Stability Study using Deep Reinforcement Learning
Zhuoxun Chen
Highlights in Science, Engineering and Technology | VOL. 68
Zhuoxun ChenZhuoxun Chen
09 Oct 2023
Highlights in Science, Engineering and Technology | VOL. 68

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Harnessing deep reinforcement learning algorithms for image categorization: A multi algorithm approach
Dhanvanth Reddy Yerramreddy ... Don S
Engineering Applications of Artificial Intelligence | VOL. 136
Dhanvanth Reddy Yerramreddy, et. al.Dhanvanth Reddy Yerramreddy ... Don S
17 Jul 2024
Engineering Applications of Artificial Intelligence | VOL. 136

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Addressing Action Oscillations through Learning Policy Inertia

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence