Solving nonstationary Markov decision processes via contextual decomposition: A military air battle management application

Joseph M Liles,Matthew J Robbins,Brian J Lunday

doi:10.1016/j.eswa.2023.120949

Abstract

Reinforcement learning for nonstationary problems is a subject of widespread research given that most realistic problems do not exist within static environments. Approaching these problems can require significant effort in feature engineering to provide a learning algorithm with enough useful information about the state space to uncover complex system dynamics. As an alternative for problems with sufficient data describing the nonstationary environment, we propose the contextual decomposition Markov decision process (CDMDP) as a collection of stationary sub-problems intended to approximate nonstationary problem dynamics using a linear combination of value functions. We demonstrate the effectiveness of the CDMDP approach with an application in military air battle management. We use a designed computational experiment and analysis of variance to show that a complex, nonstationary learning problem can be effectively approximated with a small set of stationary sub-problems, and that the CDMDP solution significantly improves solution quality over a baseline approach without the need for additional feature engineering. If a researcher suspects that a complex and continuously varying environment can be approximated by a small number of stationary contexts, the CDMDP framework may save significant computational resources and yield decision policies that are much easier to visualize and implement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Solving nonstationary Markov decision processes via contextual decomposition: A military air battle management application

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications

Lead the way for us

Journal: Expert Systems With Applications	Publication Date: Jul 8, 2023
Citations: 1

Similar Papers

Intelligent Gateway Selection and User Scheduling in Non-Stationary Air-Ground Networks
Youkun Peng ... Shuang Qin
-
Youkun Peng, et. al.Youkun Peng ... Shuang Qin
04 Dec 2022
04 Dec 2022

On the Optimality of Structured Policies in Countable Stage Decision Processes
Evan L Porteus
Management Science | VOL. 22
Evan L PorteusEvan L Porteus
01 Oct 1975
Management Science | VOL. 22

Physics-Constrained Bayesian Optimization for Optimal Actuators Placement in Composite Structures Assembly
Areej Albahar ... Inyoung Kim
IEEE Transactions on Automation Science and Engineering | VOL. 20
Areej Albahar, et. al.Areej Albahar ... Inyoung Kim
01 Oct 2023
IEEE Transactions on Automation Science and Engineering | VOL. 20

A context aware model for autonomous agent stochastic planning
Omer Ekmekci ... Faruk Polat
Robotics and Autonomous Systems | VOL. 112
Omer Ekmekci, et. al.Omer Ekmekci ... Faruk Polat
30 Nov 2018
Robotics and Autonomous Systems | VOL. 112

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Solving nonstationary Markov decision processes via contextual decomposition: A military air battle management application

Abstract

Talk to us

Similar Papers

More From: Expert Systems With Applications