Enhancing stability and explainability in reinforcement learning with machine learning

Yinhe Chen

doi:10.54254/2755-2721/101/20240943

Abstract

Abstract. In the field of reinforcement learning, training agents using machine learning algorithms to learn and perform tasks in complex environments has become a prevalent approach. However, reinforcement learning faces challenges such as training instability and decision opacity, which limit its feasibility in real-world applications. To solve the problems of stability and transparency in reinforcement learning, this project will use advanced algorithms like Proximal Policy Optimization (PPO), Q-DAGGER, and Gradient Boosting Decision Trees to set up reinforcement learning agents in the OpenAI Gymnasium environment. Specifically, the study selected the Atari game Breakout as the testbed, enhancing training efficiency and game performance by refining reward structures and decision-making processes, and integrating interpretable models to provide explanations for agent decisions. This study has successfully developed robust reinforcement learning agents that excel in complex environments. By employing advanced algorithms like PPO, Q-DAGGER, and Gradient Boosting Decision Trees, the study has addressed issues of training instability, and improved game performance through optimized reward structures and decision processes. Additionally, by integrating interpretable models, the study has provided insights into the learned strategies of the agents, thereby enhancing decision transparency. These findings provide crucial support for the broader application of reinforcement learning in real-world scenarios and offer valuable insights for tackling other complex tasks.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing stability and explainability in reinforcement learning with machine learning

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering

Lead the way for us

Similar Papers

Next-gen resource optimization in NB-IoT networks: Harnessing soft actor–critic reinforcement learning
S Anbazhagan ... R.K Mugelan
Computer Networks | VOL. 252
S Anbazhagan, et. al.S Anbazhagan ... R.K Mugelan
01 Jul 2024
Computer Networks | VOL. 252

Creating valid adversarial examples of malware
Matouš Kozák ... Fabio Di Troia
Journal of Computer Virology and Hacking Techniques | VOL. 20
Matouš Kozák, et. al.Matouš Kozák ... Fabio Di Troia
18 Mar 2024
Journal of Computer Virology and Hacking Techniques | VOL. 20

Comparative analysis of deep learning algorithms with reinforcement DDPG, PPO and SAC for unmanned car control in CARLA simulator
Maksim Konstantinovich Tikhonov
Research result. Information technologies | VOL. 9
Maksim Konstantinovich TikhonovMaksim Konstantinovich Tikhonov
28 Jun 2024
Research result. Information technologies | VOL. 9

Using Reinforcement Learning Algorithms to Explore COVID-19 Spread in South Africa
Ruan Le Hanie ... Jt Janse Van Rensburg
-
Ruan Le Hanie, et. al.Ruan Le Hanie ... Jt Janse Van Rensburg
05 Aug 2021
05 Aug 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing stability and explainability in reinforcement learning with machine learning

Abstract

Talk to us

Similar Papers

More From: Applied and Computational Engineering