AIOC[formula omitted]: A deep Q-learning approach to autonomic I/O congestion control in Lustre

Wen Cheng,Shijun Deng,Lingfang Zeng,Yang Wang,André Brinkmann

doi:10.1016/j.parco.2021.102855

Abstract

In high performance computing systems, I/O congestion is a common problem in large-scale distributed file systems. However, the current implementation mainly requires administrator to manually design low-level implementation and optimization, we proposes an adaptive I/O congestion control framework, named AIOC2, which can not only adaptively tune the I/O congestion control parameters, but also exploit the deep Q-learning method to start the training parameters and optimize the tuning for different types of workloads from the server and the client at the same time. AIOC2 combines the feedback-based dynamic I/O congestion control and deep Q-learning parameter tuning technology to achieve autonomic I/O congestion control, improve system I/O throughput, and thus reduce I/O latency without human interference. Experimental results show that AIOC2 can greatly reduce the impact of I/O congestion on I/O throughput and I/O latency performance in Lustre clusters. Compared to existing Lustre cluster systems, AIOC2 can increase write I/O throughput by 34.82% and decrease I/O latency by 26.17% on average.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

AIOC[formula omitted]: A deep Q-learning approach to autonomic I/O congestion control in Lustre

Abstract

Talk to us

Similar Papers

More From: Parallel Computing

Lead the way for us

Journal: Parallel Computing	Publication Date: Sep 29, 2021
Citations: 3

Similar Papers

A Deep Reinforcement Learning Framework for Optimizing Congestion Control in Data Centers
Shiva Ketabi ... Hongkai Chen
-
Shiva Ketabi, et. al.Shiva Ketabi ... Hongkai Chen
08 May 2023
08 May 2023

Performance evaluation of compound TCP+ congestion control parameters in wireless networks
Hiroyuki Hisamatsu ... Hiroki Oda
-
Hiroyuki Hisamatsu, et. al.Hiroyuki Hisamatsu ... Hiroki Oda
01 May 2014
01 May 2014

TCP Internal Buffers Optimization for Fast Long-Distance Links
A Baiocchi ... F Vacirca
-
A Baiocchi, et. al.A Baiocchi ... F Vacirca
01 Jan 2006
01 Jan 2006

Effectiveness and issues of congestion control in 802.11g wireless LANs
M Borri ... M Casoni
Wireless Networks | VOL. 14
M Borri, et. al.M Borri ... M Casoni
09 Oct 2006
Effectiveness and issues of congestion control in 802.11g wireless LANs
M Borri ... M Casoni

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

AIOC[formula omitted]: A deep Q-learning approach to autonomic I/O congestion control in Lustre

Abstract

Talk to us

Similar Papers

More From: Parallel Computing