Deep reinforcement learning for multi-objective game strategy selection

Ruhao Jiang,Yanchen Deng,Yingying Chen,He Luo,Bo An

doi:10.1016/j.cor.2024.106683

Abstract

Multi-objective game (MOG) is a fundamental model for the decision-making problems in which each player must consider multi-dimensional payoffs that reflect different objectives. Typically, solving MOG involves refining the set of equilibrium strategies, which is also known as MOG strategy selection (MOGS). However, existing MOG algorithms only allow one metric for MOGS, which limits the application in real-world scenarios where the players may have different preferences over multiple metrics. In this paper, we first develop a preference-based MOGS framework to encompass multiple metrics with different preferences in MOGS. Based on the framework, we introduce the concept of comprehensive evaluation value (CEV) to evaluate the quality of a strategy set given the preference of each metric. Using CEV as a reward signal, we formulate the problem of finding the optimal strategy set as a Markov decision process, and use deep reinforcement learning to train a policy for MOG strategy selection given the metrics and the corresponding preferences. Specifically, we combine a rational strategy filtering procedure with a Transformer-based encoder–decoder policy network to refine the strategies given the preferences, and then we use a revised REINFORCE algorithm to train the policy network. Besides, we introduce variable beam search decoding to improve the quality of a rollout by keeping track of the most promising strategy sets and choosing the best one. We benchmark our algorithm on the MOG instances generated by GAMUT, and extensive experiments demonstrate that our algorithm can generate the strategy set significantly better than the state-of-the-art baselines with lower computational overhead given different preferences. Furthermore, we compare our approach on real-world problems, showing the great advantages in both performance and runtime.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep reinforcement learning for multi-objective game strategy selection

Abstract

Talk to us

Similar Papers

More From: Computers and Operations Research

Lead the way for us

Similar Papers

Autonomous Driving Decision-making Based on the Combination of Deep Reinforcement Learning and Rule-based Controller
Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
-
Jinzhu Wang Jinzhu Wang, et. al.Jinzhu Wang Jinzhu Wang ... Jie Bai Jie Bai
30 Sep 2021
30 Sep 2021

What can classic Atari video games tell us about the human brain?
Raphael Köster ... Martin J Chadwick
Neuron | VOL. 109
Raphael Köster, et. al.Raphael Köster ... Martin J Chadwick
01 Feb 2021
Neuron | VOL. 109

Deep Reinforcement Learning: A New Frontier in Computer Vision Research
Sejuti Rahman ... Sujan Sarker
-
Sejuti Rahman, et. al.Sejuti Rahman ... Sujan Sarker
01 Jan 2020
01 Jan 2020

Deep contextual reinforcement learning algorithm for scalable power scheduling
Awol Seid Ebrie ... Young Jin Kim
Applied Soft Computing | VOL. 167
Awol Seid Ebrie, et. al.Awol Seid Ebrie ... Young Jin Kim
16 Sep 2024
Applied Soft Computing | VOL. 167

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep reinforcement learning for multi-objective game strategy selection

Abstract

Talk to us

Similar Papers

More From: Computers and Operations Research