Abstract
Failures are not uncommon in production data center networks (DCNs) nowadays. It takes long time for the DCN routing to recover from a failure and find new forwarding paths, significantly impacting realtime and interactive applications at the upper layer. In this paper, we present a fault-tolerant DCN solution, called ${\mathrm {F^{2}}}$ Tree, which is readily deployed in existing DNCs. ${\mathrm {F^{2}}}$ Tree can significantly improve the failure recovery time only through a small amount of link rewiring and switch configuration changes. Through testbed and emulation experiments, we show that ${\mathrm {F^{2}}}$ Tree can greatly reduce the routing recovery time after failure (by 78%) and improve the performance of upper layer applications when routing failure happens (96% less deadline-missing requests).
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have