CPU Workload Research Articles

This paper studies a sequential task offloading problem for a multiuser mobile edge computing (MEC) system. While most of the existing works consider static one-shot offloading optimization with fixed wireless channel conditions and fixed computational tasks, we consider a dynamic optimization approach, which embraces wireless channel fluctuations and random deep neural network (DNN) task arrivals over an infinite horizon. Specifically, we introduce a local CPU workload queue (WD-QSI) and an MEC server workload queue (MEC-QSI) to model the dynamic workload of DNN tasks at each WD and the MEC server, respectively. The transmit power and the partitioning of the local DNN task at each WD are dynamically determined based on the instantaneous channel conditions (to capture the transmission opportunities) and the instantaneous WD-QSI and MEC-QSI (to capture the dynamic urgency of the tasks) to minimize the average latency of the DNN tasks. The joint optimization can be formulated as an ergodic Markov decision process (MDP), in which the optimality condition is characterized by a centralized Bellman equation. However, the brute force solution of the MDP is not viable due to the curse of dimensionality as well as the requirement for knowledge of the global state information. To overcome these issues, we first decompose the MDP into multiple lower dimensional sub-MDPs, each of which can be associated with a WD or the MEC server. Next, we further develop a parametric online Q-learning algorithm, so that each sub-MDP is solved locally at its associated WD or the MEC server. The proposed solution is completely decentralized in the sense that the transmit power for sequential offloading and the DNN task partitioning can be determined based on the local channel state information (CSI) and the local WD-QSI at the WD only. Additionally, no prior knowledge of the distribution of the DNN task arrivals or the channel statistics will be needed for the MEC server. The proposed solution can achieve the superb performance over various state-of-the-art baselines.

Read full abstract

Predicting workload behavior during workload execution is essential for dynamic resource optimization in multi-processor systems. Recent studies have proposed advanced machine learning techniques for dynamic workload prediction. Workload prediction can be cast as a time series forecasting problem. However, traditional forecasting models struggle to predict abrupt workload changes. These changes occur because workloads are known to go through phases. Prior work has investigated machine-learning-based approaches for phase detection and prediction, but such approaches have not been studied in the context of dynamic workload forecasting. In this article, we propose phase-aware CPU workload forecasting as a novel approach that applies long-term phase prediction to improve the accuracy of short-term workload forecasting. Phase-aware forecasting requires machine learning models for phase classification, phase prediction, and phase-based forecasting that have not been explored in this combination before. Furthermore, existing prediction approaches have only been studied in single-core settings. This work explores phase-aware workload forecasting with multi-threaded workloads running on multi-core systems. We propose different multi-core settings differentiated by the number of cores they access and whether they produce specialized or global outputs per core. We study various advanced machine learning models for phase classification, phase prediction, and phase-based forecasting in isolation and different combinations for each setting. We apply our approach to forecasting of multi-threaded Parsec and SPEC workloads running on an eight-core Intel Core-i9 platform. Our results show that combining GMM clustering with LSTMs for phase prediction and phase-based forecasting yields the best phase-aware forecasting results. An approach that uses specialized models per core achieves an average error of 23% with up to 22% improvement in prediction accuracy compared to a phase-unaware setup.

Read full abstract

CPU Workload Research Articles

Related Topics

Articles published on CPU Workload

NLTSP: A cost model for tensor program tuning using nested loop trees

EPO‐R: An efficient garbage collection scheme for long‐term transactions

CIPO: Efficient, lightweight and programmable packet scheduling

Reduced CPU Workload for Human Pose Detection with the Aid of a Low-Resolution Infrared Array Sensor on Embedded Systems

Next generation edge computing: A roadmap to net zero emissions

Sequential Offloading for Distributed DNN Computation in Multiuser MEC Systems

A scheduling algorithm to maximize storm throughput in heterogeneous cluster

Decentralized DNN Task Partitioning and Offloading Control in MEC Systems With Energy Harvesting Devices

Learning-based Phase-aware Multi-core CPU Workload Forecasting

University of Warsaw Lagrangian Cloud Model (UWLCM) 2.0: adaptation of a mixed Eulerian–Lagrangian numerical model for heterogeneous computing clusters

Optimizing the EDP of OpenMP applications via concurrency throttling and frequency boosting

Novel Dynamic Scaling Algorithm for Energy Efficient Cloud Computing

Many-Objective Quantum-Inspired Particle Swarm Optimization Algorithm for Placement of Virtual Machines in Smart Computing Cloud.

Detection of and Countermeasure Against Thermal Covert Channel in Many-Core Systems

Managing overloaded hosts for energy-efficiency in cloud data centers

BHyPreC: A Novel Bi-LSTM Based Hybrid Recurrent Neural Network Model to Predict the CPU Workload of Cloud Virtual Machine

Detection of time series patterns and periodicity of cloud computing workloads

Improving Processing Speed of Real Time Stereo Matching u sing Heterogenous CPU GPU Model

The $$CiS^2$$: a new metric for performance and energy trade-off in consolidated servers

Modeling Real-World Load Patterns for Benchmarking in Clouds and Clusters

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

CPU Workload Research Articles

Related Topics

Articles published on CPU Workload

NLTSP: A cost model for tensor program tuning using nested loop trees

EPO‐R: An efficient garbage collection scheme for long‐term transactions

CIPO: Efficient, lightweight and programmable packet scheduling

Reduced CPU Workload for Human Pose Detection with the Aid of a Low-Resolution Infrared Array Sensor on Embedded Systems

Next generation edge computing: A roadmap to net zero emissions

Sequential Offloading for Distributed DNN Computation in Multiuser MEC Systems

A scheduling algorithm to maximize storm throughput in heterogeneous cluster

Decentralized DNN Task Partitioning and Offloading Control in MEC Systems With Energy Harvesting Devices

Learning-based Phase-aware Multi-core CPU Workload Forecasting

University of Warsaw Lagrangian Cloud Model (UWLCM) 2.0: adaptation of a mixed Eulerian–Lagrangian numerical model for heterogeneous computing clusters

Optimizing the EDP of OpenMP applications via concurrency throttling and frequency boosting

Novel Dynamic Scaling Algorithm for Energy Efficient Cloud Computing

Many-Objective Quantum-Inspired Particle Swarm Optimization Algorithm for Placement of Virtual Machines in Smart Computing Cloud.

Detection of and Countermeasure Against Thermal Covert Channel in Many-Core Systems

Managing overloaded hosts for energy-efficiency in cloud data centers

BHyPreC: A Novel Bi-LSTM Based Hybrid Recurrent Neural Network Model to Predict the CPU Workload of Cloud Virtual Machine

Detection of time series patterns and periodicity of cloud computing workloads

Improving Processing Speed of Real Time Stereo Matching u sing Heterogenous CPU GPU Model

The $$CiS^2$$: a new metric for performance and energy trade-off in consolidated servers

Modeling Real-World Load Patterns for Benchmarking in Clouds and Clusters