Abstract Data center performance is affected by three main factors; bandwidth, latency, and reliability of intra-data center interconnection network. Bandwidth and latency are definitely improved by adopting optical technology for intra-data center communication, but fault tolerance of the corresponding optical networks has been raised less. Recently, we introduced two Torus-based, all-optical, and non-blocking networks, i.e. O-TF and O-FTF, addressing reliability of optical networks, and now, in this paper, to address the scalability problem, we propose a novel Optical Clos-based architecture which reduces minimum number of required wavelength channels, as well as, the switch size in each node. Moreover, we examine three different types of redundancies; including (a) wavelength (adopted in O-TF network), (b) hardware (adopted in O-FTF network), and (c) path (adopted Optical Clos)) redundancies, in terms of throughput drop under failures, implementation cost, optical power loss, electrical power, latency, topology, and wiring complexity. In addition to simulating networks utilizing generated traffic based on Microsoft data, realistic traffic has also been considered. Our simulation results confirm that in addition to improving power consumption and latency, O-TF, O-FTF, and O-Clos architectures lead to throughput increment up to 80%, 90%, and 70%, respectively, comparing to WaveCube at the presence of network failure, as result of adopting our proposed fault-tolerant method, named as RAOP.
Read full abstract