Abstract

The explosive growth of data center (DC) traffic imposes unprecedented challenges on the current electrical switch-based data center networks (DCNs) with the bottleneck of limited bandwidth and high latency. Benefitting from transparency to the data rate and format, optical switching with theoretically infinite bandwidth could overcome the bandwidth bottleneck of electrical switching DCNs. However, DCNs normally deploy multitenant applications with a variety of DC traffic. It is hard to reconfigure the optical interconnections in real-time to provide adaptable bandwidth to the traffic with heterogeneous characteristics. Moreover, to improve the bandwidth utilization, statistical multiplexing is generally deployed in optical DCNs to forward the traffic flow in a time slot, which requires the network time to be precisely synchronized to all network nodes. For fast optical switches with a nanosecond switching configuration time, overall end node times must be synchronized at subnanosecond magnitude. In this paper, we propose and experimentally investigate a reconfigurable and picosecond-synchronized optical DCN (ReSAW) based on an arrayed waveguide grating router (AWGR) and the White Rabbit (WR) protocol. A scheduler based on a distributed field-programmable gate array is implemented in the proposed ReSAW to realize flexible wavelength configuration by controlling the fast laser array based on semiconductor optical amplifiers (SOAs) according to time slot and traffic priority. Moreover, the WR protocol is implemented in optical DCNs for what we believe is the first time to synchronize the time of the distributed top of racks (ToRs). The experimental demonstration validates that ReSAW achieves an average end-to-end latency of 317.44 ns and a precise synchronized time with an average 386 ps skew. When the load is 0.4, the packet loss after ReSAW reconfiguration is less than 1.83 × 10 − 6 , and the network latency is less than 1.73 µs. Based on the experimental parameters and results, the OMNeT++ simulation model is built to further verify the reconfigurability and scalability of the ReSAW network. Results show that the packet loss rate and latency performance increase 8.24% and 12.47%, respectively, at a load of 0.6 as the ReSAW network scales from 2560 to 40,960 servers compared to before the reconfiguration.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call