Abstract
By overcoming the server box barrier, resource disaggregation in data centers significantly improves resource utilization, which also provides a more cost-efficient approach for resource upgrade and expansion. The advantages of disaggregation have been explored in earlier research to improve the resource efficiency. This paper investigates the potential benefits of disaggregation from the aspect of reliability, which has not been considered before. Resource disaggregation brings a new failure pattern. For example, in a conventional server, the failure of one type of resource leads to the failure of the entire server, so that other types of resources in the same server also become unavailable. After disaggregating, the failure of different types of resources becomes more isolated, so that other resources are still available. In this paper, we model the reliability of a resource allocation request in a server-based or disaggregated DC based on whether the request is allocated with only working resources, or also provisioned with backup. We then consider a resource allocation problem with the objective of maximizing the number of requests accepted with guaranteed reliability. We provide an integer linear programming and a heuristic approach for this. Numerical studies demonstrate that resource disaggregation is possible to improve service reliability.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have