Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning.

Yana Yang,Meng Xi,Huiao Dai,Jiabao Wen,Jiachen Yang

doi:10.3390/s24237746

Abstract

Reinforcement learning, as a machine learning method that does not require pre-training data, seeks the optimal policy through the continuous interaction between an agent and its environment. It is an important approach to solving sequential decision-making problems. By combining it with deep learning, deep reinforcement learning possesses powerful perception and decision-making capabilities and has been widely applied to various domains to tackle complex decision problems. Off-policy reinforcement learning separates exploration and exploitation by storing and replaying interaction experiences, making it easier to find global optimal solutions. Understanding how to utilize experiences is crucial for improving the efficiency of off-policy reinforcement learning algorithms. To address this problem, this paper proposes Z-Score Prioritized Experience Replay, which enhances the utilization of experiences and improves the performance and convergence speed of the algorithm. A series of ablation experiments demonstrate that the proposed method significantly improves the effectiveness of deep reinforcement learning algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Journal: Sensors (Basel, Switzerland)	Publication Date: Dec 4, 2024
License type: CC BY 4.0

Similar Papers

RETRACTED: Bas et al. Embedded Sensors with 3D Printing Technology: Review. Sensors 2024, 24, 1955.
Joan Bas ... Satyendra K Mishra
Sensors (Basel, Switzerland) | VOL. 24
Joan Bas, et. al.Joan Bas ... Satyendra K Mishra
13 Dec 2024
RETRACTED: Bas et al. Embedded Sensors with 3D Printing Technology: Review. Sensors 2024, 24, 1955.
Joan Bas ... Satyendra K Mishra

RETRACTED: Qian et al. Information System Model and Key Technologies of High-Definition Maps in Autonomous Driving Scenarios. Sensors 2024, 24, 4115.
Sensors Editorial Office
Sensors (Basel, Switzerland) | VOL. 24
Sensors Editorial Office Sensors Editorial Office
11 Dec 2024
Sensors (Basel, Switzerland) | VOL. 24

Application of Online Anomaly Detection Using One-Class Classification to the Z24 Bridge.
Amro Abdrabo
Sensors (Basel, Switzerland) | VOL. 24
Amro AbdraboAmro Abdrabo
09 Dec 2024
Sensors (Basel, Switzerland) | VOL. 24

A Hierarchical-Based Learning Approach for Multi-Action Intent Recognition.
David Hollinger ... Michael Zabala
Sensors (Basel, Switzerland) | VOL. 24
David Hollinger, et. al.David Hollinger ... Michael Zabala
09 Dec 2024
Sensors (Basel, Switzerland) | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Z-Score Experience Replay in Off-Policy Deep Reinforcement Learning.

Abstract

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)