Multi-agent Learning Algorithm Research Articles

User-centric radio access technology (RAT) selection is a key communication paradigm, given the increased number of available RATs and increased cognitive capabilities at the user end. When considered against traditional network-centric approaches, user-centric RAT selection results in reduced network-side management load, and leads to lower operational costs for RATs, as well as improved quality of service (QoS) and quality of experience (QoE) for users. The complex between-users interactions involved in RAT selection require, however, specific analyses, toward developing reliable and efficient schemes. Two theoretical frameworks are most often applied to user-centric RAT selection analysis, i.e., game theory (GT) and multi-agent learning (MAL). As a consequence, several GT models and MAL algorithms have been recently proposed to solve the problem at hand. A comprehensive discussion of such models and algorithms is, however, currently missing. Moreover, novel issues introduced by next-generation communication systems also need to be addressed. This paper proposes to fill the above gaps by providing a unified reference for both ongoing research and future research directions in the field. In particular, the review addresses the most common GT and MAL models and algorithms, and scenario settings adopted in user-centric RAT selection in terms of utility function and network topology. Regarding GT, the review focuses on non-cooperative models, because of their widespread use in RAT selection; as for MAL, a large number of algorithms are described, ranging from game-theoretic to reinforcement learning (RL) schemes, and also including most recent approaches, such as deep RL (DRL) and multi-armed bandit (MAB). Models and algorithms are analyzed by comparatively reviewing relevant literature. Finally, open challenges are discussed, in light of ongoing research and standardization activities.

Read full abstract

Two multi-agent policy iteration learning algorithms are proposed in this work. The two proposed algorithms use the exponential moving average approach along with the Q-learning algorithm as a basis to update the policy for the learning agent so that the agent's policy converges to a Nash equilibrium policy. The first proposed algorithm uses a constant learning rate when updating the policy of the learning agent, while the second proposed algorithm uses two different decaying learning rates. These two decaying learning rates are updated based on either the Win-or-Learn-Fast (WoLF) mechanism or the Win-or-Learn-Slow (WoLS) mechanism. The WoLS mechanism is introduced in this article to make the algorithm learn fast when it is winning and learn slowly when it is losing. The second proposed algorithm uses the rewards received by the learning agent to decide which mechanism (WoLF mechanism or WoLS mechanism) to use for the game being learned. The proposed algorithms have been theoretically analyzed and a mathematical proof of convergence to pure Nash equilibrium is provided for each algorithm. In the case of games with mixed Nash equilibrium, our mathematical analysis shows that the second proposed algorithm converges to an equilibrium. Although our mathematical analysis does not explicitly show that the second proposed algorithm converges to a Nash equilibrium, our simulation results indicate that the second proposed algorithm does converge to Nash equilibrium. The proposed algorithms are examined on a variety of matrix and stochastic games. Simulation results show that the second proposed algorithm converges in a wider variety of situations than state-of-the-art multi-agent reinforcement learning algorithms.

Read full abstract

Multi-agent Learning Algorithm Research Articles

Related Topics

Articles published on Multi-agent Learning Algorithm

Rationality-bounded adaptive learning in multi-agent dynamic games

Multi-Agent Reinforcement Learning for a Random Access Game

Multi-agent learning algorithms for content placement in cache-enabled small cell networks: 4G and 5G use cases

User-Centric Radio Access Technology Selection: A Survey of Game Theory Models and Multi-Agent Learning Algorithms

On Gradient-Based Learning in Continuous Games

Bounds and dynamics for empirical game theoretic analysis

SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes

Dynamic Spectrum Access in Time-Varying Environment: Distributed Learning Beyond Expectation Optimization

Cooperative Multi-Agent Joint Action Learning Algorithm (CMJAL) for Decision Making in Retail Shop Application

English

Sequence-Form and Evolutionary Dynamics: Realization Equivalence to Agent Form and Logit Dynamics

Exponential moving average based multiagent reinforcement learning algorithms

Scalable multi-agent learning algorithms to determine winners in combinatorial double auctions

Discrete-time dynamic graphical games: model-free reinforcement learning solution

Evolutionary Dynamics of Q-Learning over the Sequence Form

A robust approach for multi-agent natural resource allocation based on stochastic optimization algorithms

Decentralized Anti-coordination Through Multi-agent Learning

Multiagent learning in the presence of memory-bounded agents

Learning in Repeated Games with Minimal Information: The Effects of Learning Bias

Multiagent Learning in Large Anonymous Games

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Multi-agent Learning Algorithm Research Articles

Related Topics

Articles published on Multi-agent Learning Algorithm

Rationality-bounded adaptive learning in multi-agent dynamic games

Multi-Agent Reinforcement Learning for a Random Access Game

Multi-agent learning algorithms for content placement in cache-enabled small cell networks: 4G and 5G use cases

User-Centric Radio Access Technology Selection: A Survey of Game Theory Models and Multi-Agent Learning Algorithms

On Gradient-Based Learning in Continuous Games

Bounds and dynamics for empirical game theoretic analysis

SA-IGA: a multiagent reinforcement learning method towards socially optimal outcomes

Dynamic Spectrum Access in Time-Varying Environment: Distributed Learning Beyond Expectation Optimization

Cooperative Multi-Agent Joint Action Learning Algorithm (CMJAL) for Decision Making in Retail Shop Application

English

Sequence-Form and Evolutionary Dynamics: Realization Equivalence to Agent Form and Logit Dynamics

Exponential moving average based multiagent reinforcement learning algorithms

Scalable multi-agent learning algorithms to determine winners in combinatorial double auctions

Discrete-time dynamic graphical games: model-free reinforcement learning solution

Evolutionary Dynamics of Q-Learning over the Sequence Form

A robust approach for multi-agent natural resource allocation based on stochastic optimization algorithms

Decentralized Anti-coordination Through Multi-agent Learning

Multiagent learning in the presence of memory-bounded agents

Learning in Repeated Games with Minimal Information: The Effects of Learning Bias

Multiagent Learning in Large Anonymous Games