Abstract

Communication is vital to complete tasks coordinately for robot swarms. In this paper, we investigate massive MIMO enabled robot swarms. Specifically, for the robot swarms, the transceiver beamforming not only needs to maximize the rate, but also has to restrict the interference on other receivers. Therefore, the transceiver design of robots is critical to optimize the sum-rate performance under the restriction of the interference on the a specific robot. Currently, only exhaustive search is able to provide the optimal solution for the problem, whereas its complexity is unacceptable. In this paper, to address the intractable issue, based on the max-plus approach, we consider each transmitter or receiver as an independent decision agent, and all robots coordinately choose the optimal joint beam combination by max-plus algorithm. In the multi-agent framework, each agent learns the policy of choosing analog beam by reinforcement learning (RL). Furthermore, to improve the learning efficiency of RL and reduce the transmission latency, we exploit the efficient ELM network to replace the deep network of deep RL, and propose a ELM-based RL method to conduct the transmission between robots in robot swarm. Analysis and simulation results reveal that, the proposed method is able to achieve a near-optimal sum-rate performance, while the complexity is acceptable.

Highlights

  • Robot swarm [1], [2]is one of the hottest fields, since it can be widely used in artificial intelligence and the internet of things

  • We investigate hybrid precoding (HP) design for massive MIMO employed on robots

  • Only exhaustive search can provide the optimal solution for this problem, but its complexity is very massive

Read more

Summary

INTRODUCTION

Robot swarm [1], [2]is one of the hottest fields, since it can be widely used in artificial intelligence and the internet of things. Since HP is able to concentrate the beams on a narrow direction range, we apply massive MIMO and HP into robot communications to restrict the interference of each other. Since the HP is able to produce the narrow beam with high directivity, the interference from transmitters can be restricted by suitable beamformer This would be a novel and promising solution for robot communications. 2) To address the intractable issue of maximization of sum rate of whole robot swarm, we bring in the multiagent framework where each robot is considered as an independent decision agent, and all agents coordinately choose the optimal joint beam combination by maxplus algorithm. Diag[x1, x2, · · · , xk ] denotes a matrix whose diagonal elements are composed by x1, x2, · · · , xk while the rest of VOLUME 8, 2020. S denotes a set, and |S| is the cardinality of set S

SYSTEM MODEL
SYSTEM MODEL OF PRIMARY PAIR
INCREMENT OF INPUT DATA IN ELM
SIMULATIONS
EXPERIMENTAL RESULTS
CONCLUSION
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call