Deep Reinforcement Learning Based MAC Protocol for Underwater Acoustic Networks

Xiaowen Ye,Liqun Fu

doi:10.1145/3366486.3366526

Xiaowen Ye, Liqun Fu

https://doi.org/10.1145/3366486.3366526

Copy DOI

Export

Save

Cite

Publication Date: Oct 23, 2019

Citations: 12

Affiliation: Xiamen University

Abstract
Full-Text
Similar Papers

Abstract

Listen

Long propagation delay that causes throughput degradation of underwater acoustic networks (UWANs) is a critical issue in the medium access control (MAC) design in UWANs. In this paper, we put forth a new deep reinforcement learning (DRL) algorithm called delayed-reward deep Q-network (DR-DQN) to solve this problem. In particular, we propose a new multiple access protocol, referred to as delayed-reward DRL multiple access (DR-DLMA). The system objective is to find an optimal channel access strategy that maximizes the network throughput, by making full use of the available time-slots resulted from the long propagation delay or not used by other nodes in the network. We also give the upper bound on the network throughput in various cases as our benchmark, and show that the DR-DLMA can achieve a near-optimal performance with all the different propagation delays. Furthermore, we evaluate the performance of the proposed protocol in heterogeneous and homogeneous networks, respectively. When coexisting with one FW-ALOHA node, the maximum gap between the network throughput achieved by the DR-DLMA and the optimal solution is within 10%. In homogeneous networks, we demonstrate that DR-DLMA outperforms traditional MAC protocols for UWANs (e.g., slotted FAMA and DOTS) in terms of the network throughput. Particularly, in the four-node network with a maximum propagation delay of 4, DR-DLMA yields 470% and 190% higher throughput on average than slotted FAMA and DOTS, respectively.

Full Text