RCC: Enabling Receiver-Driven RDMA Congestion Control With Congestion Divide-and-Conquer in Datacenter Networks

Jiao Zhang,Tao Huang,Xiaolong Zhong,Yu Tian,Zirui Wan,Tian Pan

doi:10.1109/tnet.2022.3185105

Abstract

The development of datacenter applications leads to the need for end-to-end communication with microsecond latency. As a result, RDMA is becoming prevalent in datacenter networks to mitigate the latency caused by the slow processing speed of the traditional software network stack. However, existing RDMA congestion control mechanisms are either far from optimal in simultaneously achieving high throughput and low latency or in need of additional in-network function support. In this paper, by leveraging the observation that most congestion occurs at the last hop in datacenter networks, we propose RCC, a receiver-driven rapid congestion control mechanism for RDMA networks that combines explicit assignment and iterative window adjustment. Firstly, we propose a network congestion distinguish method to classify congestions into two types, last-hop congestion and in-network congestion. Then, an Explicit Window Assignment mechanism is proposed to solve the last-hop congestion, which enables senders to converge to a proper sending rate in one-RTT. For in-network congestion, a PID-based iterative delay-based window adjustment scheme is proposed to achieve fast convergence and near-zero queuing latency. RCC does not need additional in-network support and is friendly to hardware implementation. In our evaluation, the overall average FCT (Flow Completion Time) of RCC is <inline-formula> <tex-math notation="LaTeX">$4{\sim}79\%$</tex-math> </inline-formula> better than Homa, ExpressPass, DCQCN, TIMELY, and HPCC.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

RCC: Enabling Receiver-Driven RDMA Congestion Control With Congestion Divide-and-Conquer in Datacenter Networks

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Networking

Lead the way for us

Journal: IEEE/ACM Transactions on Networking	Publication Date: Feb 1, 2023
Citations: 2

Similar Papers

Receiver-Driven RDMA Congestion Control by Differentiating Congestion Types in Datacenter Networks
Jiao Zhang ... Jiaming Shi
-
Jiao Zhang, et. al.Jiao Zhang ... Jiaming Shi
01 Nov 2021
01 Nov 2021

An Adaptable and Agnostic Flow Scheduling Approach for Data Center Networks
Sergio Armando Gutiérrez ... John W Branch-Bedoya
Journal of Network and Systems Management | VOL. 31
Sergio Armando Gutiérrez, et. al.Sergio Armando Gutiérrez ... John W Branch-Bedoya
28 Oct 2022
Journal of Network and Systems Management | VOL. 31

Efficient Data Center Flow Scheduling Without Starvation Using Expansion Ratio
Sheng Zhang ... Sanglu Lu
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Sheng Zhang, et. al.Sheng Zhang ... Sanglu Lu
01 Nov 2017
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

Information-Agnostic Traffic Scheduling in Data Center Networks with Asymmetric Topologies
Ning Wei ... Keqiu Li
-
Ning Wei, et. al.Ning Wei ... Keqiu Li
01 Jun 2019
01 Jun 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RCC: Enabling Receiver-Driven RDMA Congestion Control With Congestion Divide-and-Conquer in Datacenter Networks

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Networking