Abstract
In a power-aware scheduling system, power models are leveraged as the basis of estimating the effect of scheduling tasks. Previous studies showed that power consumption of servers is a non-linear function of resource usage, and a single set of parameters in one model can't accurately estimate power consumption at different usage levels. Gaussian Mixture Model (GMM) is a unsupervised machine learning model, which contains multiple GMM clusters. These clusters can be used to co-relate power consumption with resource features at different usage levels. In this paper we further adapt GMM for power estimation in a distributed computing cluster. We use basic OS-reported resource features (CPU utilization, memory utilization etc.) of a server in our GMM, and this makes operators easily monitor and control the state of the server once scheduling decision is made. In addition, our GMM uses conditional probability to obtain fine-grained regression. We train the model using full features, which has the higher accuracy comparing with that only using CPU or part of features. In the end, we evaluate the power models in terms of not only accuracy but also usability. Compare to other linear or non-linear models, GMM has the highest accuracy but requires the longest training time.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.