Execution Time Prediction Research Articles

SummaryCloud providers offer flexible infrastructures and on‐demand services, including the capability to deploy low cost virtual resources of many different types. However, the diversity of cloud resources followed by the important trade‐off between cost and performance makes the resource selection a challenging task for users in the case of parallel communication‐intensive software. The paper presents cost‐ and performance‐aware resource selection for parallel discrete element method (DEM) software as a service (SaaS) on heterogeneous OpenStack cloud. The developed resource selection uses preliminary application‐specific benchmarks of size smaller than targeted problems and the performance prediction based on speedup of parallel computations to obtain Pareto optimal solutions and to select the best configuration of containers from user's perspective. Hybrid parallelization of DEM software is developed by using OpenCL for shared‐memory multi‐core architectures and MPI for internode communications on distributed‐memory computer clusters. Round up and proportional pricing schemes are examined and compared from a user's perspective. Lower cost of computations obtained by using the proportional pricing scheme is always preferable for users. However, the difference approaches 1.0% of the cost calculated by using proportional pricing scheme, when long lasting computations are performed. The prediction tends to underestimate the execution time of DEM SaaS, but its accuracy is sufficient to obtain the same Pareto optimal solutions by using measured and predicted execution times. Pareto front and linear scalarization propose to select configurations of containers capable of exploiting higher memory bandwidth, which is specific to memory bandwidth bound DEM computations.

Read full abstract

Predicting the performance of applications executed on GPUs is a great challenge and is essential for efficient job schedulers. There are different approaches to do this, namely analytical modeling and machine learning (ML) techniques. Machine learning requires large training sets and reliable features, nevertheless it can capture the interactions between architecture and software without manual intervention.In this paper, we compared a BSP-based analytical model to predict the time of execution of kernels executed over GPUs. The comparison was made using three different ML techniques. The analytical model is based on the number of computations and memory accesses of the GPU, with additional information on cache usage obtained from profiling. The ML techniques Linear Regression, Support Vector Machine, and Random Forest were evaluated over two scenarios: first, data input or features for ML techniques were the same as the analytical model and, second, using a process of feature extraction, which used correlation analysis and hierarchical clustering. Our experiments were conducted with 20 CUDA kernels, 11 of which belonged to 6 real-world applications of the Rodinia benchmark suite, and the other were classical matrix-vector applications commonly used for benchmarking. We collected data over 9 NVIDIA GPUs in different machines.We show that the analytical model performs better at predicting when applications scale regularly. For the analytical model a single parameter λ is capable of adjusting the predictions, minimizing the complex analysis in the applications. We show also that ML techniques obtained high accuracy when a process of feature extraction is implemented. Sets of 5 and 10 features were tested in two different ways, for unknown GPUs and for unknown Kernels. For ML experiments with a process of feature extractions, we got errors around 1.54% and 2.71%, for unknown GPUs and for unknown Kernels, respectively.

Read full abstract

Execution Time Prediction Research Articles

Related Topics

Articles published on Execution Time Prediction

EXPRESS: A Framework for Execution Time Prediction of Concurrent CNNs on Xilinx DPU Accelerator

Analyzing Machine Learning Models for Activity Recognition Using Homomorphically Encrypted Real-World Smart Home Datasets: A Case Study

A novel query execution time prediction approach based on operator iterate-aware of the execution plan on the graph database

CAMP: a hierarchical cache architecture for multi-core mixed criticality processors

A Novel Multi-Task Performance Prediction Model for Spark

Improving prediction of computational job execution times with machine learning

Cost‐ and performance‐aware resource selection for parallel software on heterogeneous cloud

Optimizing job scheduling by using broad learning to predict execution times on HPC clusters

AFMC: An alignment framework for multiple computing services and providers

EXSCALATE: An Extreme-Scale Virtual Screening Platform for Drug Discovery Targeting Polypharmacology to Fight SARS-CoV-2

SSO-RBNN driven brain tumor classification with Saliency-K-means segmentation technique

Evaluating execution time predictions on GPU kernels using an analytical model and machine learning techniques

Cost-Based or Learning-Based?

Predicting remaining execution time of business process instances via auto-encoded transition system

A Constructive Heuristic for Automated Parallel Tests Assembly

Execution Time Prediction for Cypher Queries in the Neo4j Database Using a Learning Approach

A node2vec-based graph embedding approach for unified assembly process information modeling and workstep execution time prediction

Edge Computing Using Embedded Webserver with Mobile Device for Diagnosis and Prediction of Metastasis in Histopathological Images

Execution time prediction model for parallel GPU realizations of discrete transforms computation algorithms

Estimating Query Timings in Elasticsearch

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Execution Time Prediction Research Articles

Related Topics

Articles published on Execution Time Prediction

EXPRESS: A Framework for Execution Time Prediction of Concurrent CNNs on Xilinx DPU Accelerator

Analyzing Machine Learning Models for Activity Recognition Using Homomorphically Encrypted Real-World Smart Home Datasets: A Case Study

A novel query execution time prediction approach based on operator iterate-aware of the execution plan on the graph database

CAMP: a hierarchical cache architecture for multi-core mixed criticality processors

A Novel Multi-Task Performance Prediction Model for Spark

Improving prediction of computational job execution times with machine learning

Cost‐ and performance‐aware resource selection for parallel software on heterogeneous cloud

Optimizing job scheduling by using broad learning to predict execution times on HPC clusters

AFMC: An alignment framework for multiple computing services and providers

EXSCALATE: An Extreme-Scale Virtual Screening Platform for Drug Discovery Targeting Polypharmacology to Fight SARS-CoV-2

SSO-RBNN driven brain tumor classification with Saliency-K-means segmentation technique

Evaluating execution time predictions on GPU kernels using an analytical model and machine learning techniques

Cost-Based or Learning-Based?

Predicting remaining execution time of business process instances via auto-encoded transition system

A Constructive Heuristic for Automated Parallel Tests Assembly

Execution Time Prediction for Cypher Queries in the Neo4j Database Using a Learning Approach

A node2vec-based graph embedding approach for unified assembly process information modeling and workstep execution time prediction

Edge Computing Using Embedded Webserver with Mobile Device for Diagnosis and Prediction of Metastasis in Histopathological Images

Execution time prediction model for parallel GPU realizations of discrete transforms computation algorithms

Estimating Query Timings in Elasticsearch