Abstract

Multi-agent systems and multi-robot systems have been recognized as unique solutions to complex dynamic tasks distributed in space. Their effectiveness in accomplishing these tasks rests upon the design of cooperative control strategies, which is acknowledged to be challenging and nontrivial. In particular, the effectiveness of these strategies has been shown to be related to the so-called exploration–exploitation dilemma: i.e., the existence of a distinct balance between exploitative actions and exploratory ones while the system is operating. Recent results point to the need for a dynamic exploration–exploitation balance to unlock high levels of flexibility, adaptivity, and swarm intelligence. This important point is especially apparent when dealing with fast-changing environments. Problems involving dynamic environments have been dealt with by different scientific communities using theory, simulations, as well as large-scale experiments. Such results spread across a range of disciplines can hinder one’s ability to understand and manage the intricacies of the exploration–exploitation challenge. In this review, we summarize and categorize the methods used to control the level of exploration and exploitation carried out by an multi-agent systems. Lastly, we discuss the critical need for suitable metrics and benchmark problems to quantitatively assess and compare the levels of exploration and exploitation, as well as the overall performance of a system with a given cooperative control algorithm.

Highlights

  • In recent years, there has been an increasing interest in using multi-agent systems (MAS) to carry out a wide array of complex tasks

  • The so-called exploration–exploitation dilemma is a common challenge faced by many systems operating on the basis of collective decision-making, including those encountered by various multi-agent and multi-robot systems

  • This dilemma stems from the fact that, in general, exploration and exploitation tend to be mutually exclusive tasks

Read more

Summary

INTRODUCTION

There has been an increasing interest in using multi-agent systems (MAS) to carry out a wide array of complex tasks. While this poses a unique problem for MRS practitioners, in this paper, we only focus on the behavioral aspects of how robotic agents balance their efforts in exploration and exploitation At this point, it should be noted that there is a marked difference in the strategies that are employed for use in static and dynamic tasks. There is still merit in studying the methods used to control the level of exploration and exploitation carried out in MAS operating in static and quasi-static environments This is because such systems still are susceptible to being trapped in local optima during the initial stages of the task (Leonard et al, 2011; Zou et al, 2015; Ghassemi and Chowdhury, 2019) and variations of strategies employed in static environments have been modified and applied for use in fast-evolving environments (Senanayake et al, 2016). We present possible directions for future research as well as summarizing remarks

Characterizing Exploration and Exploitation
Measuring Exploration and Exploitation
AGENT RESPONSE METHODS
Changes in State
Small Response Changes
Area and Task Assignment
INFORMATION DISSEMINATION METHODS
Changes in Network Topology
Stigmergy
CONCLUSION
DATA AVAILABILITY STATEMENT
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call