Abstract

Efficient resource management in data centers is of central importance to content service providers as 90 percent of the network traffic is expected to go through them in the coming years. In this context we propose the use of convolutional neural networks (CNNs) to forecast short-term changes in the amount of traffic crossing a data center network. This value is an indicator of virtual machine activity and can be utilized to shape the data center infrastructure accordingly. The behaviour of network traffic at the seconds scale is highly chaotic and therefore traditional time-series-analysis approaches such as ARIMA fail to obtain accurate forecasts. We show that our convolutional neural network approach can exploit the non-linear regularities of network traffic, providing significant improvements with respect to the mean absolute and standard deviation of the data, and outperforming ARIMA by an increasingly significant margin as the forecasting granularity is above the 16-second resolution. In order to increase the accuracy of the forecasting model, we exploit the architecture of the CNNs using multiresolution input distributed among separate channels of the first convolutional layer. We validate our approach with an extensive set of experiments using a data set collected at the core network of an Internet Service Provider over a period of 5 months, totalling 70 days of traffic at the one-second resolution.

Highlights

  • Nowadays, there is a general consensus in the Information and Communications Technology (ICT) industry regarding the higher demands in network bandwidth and speed that Internet and mobile systems will have to meet in comparison with today’s networks

  • In addition to Autoregressive integrated moving average (ARIMA), artificial feed forward neural networks (ANNs) and convolutional neural networks (CNNs), we evaluated a naive approach, which consisted in using the last observed value as a prediction

  • We have investigated the problem of forecasting short-term changes in data center network traffic load

Read more

Summary

Introduction

There is a general consensus in the Information and Communications Technology (ICT) industry regarding the higher demands in network bandwidth and speed that Internet and mobile systems will have to meet in comparison with today’s networks. Two additional sets of experiments were set up (1) to highlight the positive effect of including the multiresolution context (i.e. past observed values with different levels of granularity) in ANNs and in particular in CNNs, which can take further advantage of this approach through their multiple channels, and (2) to show the durability of the models, testing their ability to forecast the time series of increasingly distant future months. In this context, and given that the training procedures only take about one hour using a modern GPU, it is feasible to retrain CNN models monthly or even weekly.

Related work
Problem setting
Forecasting models
Artificial neural networks
Convolutional neural networks
Improving the quality of the forecasts
Multiresolution input for modelling exponentially wide context efficiently
Coarse-grained long-term forecasts
Multiple-channel convolutions for incorporating context
Experiments
Model fitting
Results
Durability of the models
Conclusions & future work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call