Abstract

High-Performance Computing systems rely on the software’s capability to be highly parallelized in individual computing tasks. However, even with a high parallelization level, poor scheduling can lead to long runtimes; this scheduling is in itself an NP-hard problem. Therefore, it is our interest to use a heuristic approach, particularly Cellular Processing Algorithms (CPA), which is a novel metaheuristic framework for optimization. This framework has its foundation in exploring the search space by multiple Processing Cells that communicate to exploit the search and in the individual stagnation detection mechanism in the Processing Cells. In this paper, we proposed using a Greedy Randomized Adaptive Search Procedure (GRASP) to look for promising task execution orders; later, a CPA formed with Iterated Local Search (ILS) Processing Cells is used for the optimization. We assess our approach with a high-performance ILS state-of-the-art approach. Experimental results show that the CPA outperforms the previous ILS in real applications and synthetic instances.

Highlights

  • According to the website www.top500.org, the supercomputer Fugaku from the Fujitsu RIKENCenter for Computational Science in Japan consists of 7,299,072 processing units

  • Focusing on the Fpppp instance set, Greedy Randomized Adaptive Search Procedure (GRASP)-Cellular Processing Algorithms (CPA) outperformed with statistical significance

  • For the Robot benchmark, GRASP-CPA outperformed Earliest Finish Time (EFT)-Iterated Local Search (ILS) with statistical significance in 48 cases, while EFT-ILS only outperformed GRASP-CPA in 12 instances, where most of them occurred for the instances with 16 machines

Read more

Summary

Introduction

According to the website www.top500.org, the supercomputer Fugaku from the Fujitsu RIKEN. The main idea is to cycle between exploring the search space with multiple limited-effort algorithms (Processing Cells) and sharing information (communication) among these Processing Cells. Cells should keep looking independently and not as a whole algorithm In this way, we avoid the computational cost of converging the whole population (when the solutions reach the same search space area) or exploring the search space with a single solution algorithm CPAs are flexible, e.g., a cellular processing algorithm can initialize each.

Problem Description
Instance of the Problem
Objective Function
Algorithms Descriptions
EFT-ILS Local Search
EFT-ILS Perturbation
Parallel Application Instances
Experimental Settings
Statistical Indicators
Results
Conclusions and Future Work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call