An Intelligent TCP Congestion Control Method Based on Deep Q Network

Yinfeng Wang,Longxiang Wang,Xiaoshe Dong

doi:10.3390/fi13100261

Yinfeng Wang, Longxiang Wang + Show 1 more

Open Access

https://doi.org/10.3390/fi13100261

Copy DOI

Abstract

To optimize the data migration performance between different supercomputing centers in China, we present TCP-DQN, which is an intelligent TCP congestion control method based on DQN (Deep Q network). The TCP congestion control process is abstracted as a partially observed Markov decision process. In this process, an agent is constructed to interact with the network environment. The agent adjusts the size of the congestion window by observing the characteristics of the network state. The network environment feeds back the reward to the agent, and the agent tries to maximize the expected reward in an episode. We designed a weighted reward function to balance the throughput and delay. Compared with traditional Q-learning, DQN uses double-layer neural networks and experience replay to reduce the oscillation problem that may occur in gradient descent. We implemented the TCP-DQN method and compared it with mainstream congestion control algorithms such as cubic, Highspeed and NewReno. The results show that the throughput of TCP-DQN can reach more than 2 times of the comparison method while the latency is close to the three compared methods.

Highlights

In recent years, China’s supercomputers have made great progress
To build a more efficient congestion control method by leveraging reinforcement learning, we propose TCP-DQN, which is a congestion control method based on the DQN algorithm, to implement efficient and reliable data migration in virtual data space
The TCP-DQN is compared with the representative TCP congestion control algorithms such as cubic, NewReno, and HighSpeed

Summary

Introduction

China’s supercomputers have made great progress. The SunwayTaihuLight and Tianhe have become one of the fastest supercomputers in the world.the storage resources are widely dispersed and autonomous among the national supercomputing centers. Large-scale computing applications urgently need a global data space that can support cross-domain unified access, wide area data sharing, storage and computing collaboration. To solve this problem, we built the virtual data space to aggregate storage resources among different supercomputing centers. The core idea of reinforcement learning is trial and error mechanism and policy optimization. Agents improve their policy by trying various actions to learn the good action that can get more reward from the environment. In Q-learning, a greedy policy is used to generate the action a, which means the agent will always choose the action that can obtain the maximum Q value in each state s. According to the Bellman equation, the Q value can be iteratively calculated by Equation (1)

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Internet	Publication Date: Oct 9, 2021
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

An Intelligent TCP Congestion Control Method Based on Deep Q Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet

Lead the way for us

Similar Papers

A Comprehensive Overview of TCP Congestion Control in 5G Networks: Research Challenges and Future Perspectives.
Josip Lorincz ... Zvonimir Klarin
Sensors | VOL. 21
Josip Lorincz, et. al.Josip Lorincz ... Zvonimir Klarin
30 Jun 2021
Sensors | VOL. 21

RAN Information-Assisted TCP Congestion Control Using Deep Reinforcement Learning With Reward Redistribution
Minghao Chen ... Jianjun Wu
IEEE Transactions on Communications | VOL. 70
Minghao Chen, et. al.Minghao Chen ... Jianjun Wu
01 Jan 2021
IEEE Transactions on Communications | VOL. 70

RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution
Minghao Chen ... Zhifeng Zhao
-
Minghao Chen, et. al.Minghao Chen ... Zhifeng Zhao
01 Jun 2021
01 Jun 2021

Visualising interactions between TCP's congestion and flow control algorithms
A Ruddle ... P Lindsay
-
A Ruddle, et. al.A Ruddle ... P Lindsay
10 Dec 2002
10 Dec 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Intelligent TCP Congestion Control Method Based on Deep Q Network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet