HPCGCN: A Predictive Framework on High Performance Computing Cluster Log Data Using Graph Convolutional Networks.

Avishek Bose,Huichen Yang,Daniel Andresen,William H Hsu

doi:10.1109/bigdata52589.2021.9671370

Abstract

This paper presents a novel use case of Graph Convolutional Network (GCN) learning representations for predictive data mining, specifically from user/task data in the domain of high-performance computing (HPC). It outlines an approach based on a coalesced data set: logs from the Slurm workload manager, joined with user experience survey data from computational cluster users. We introduce a new method of constructing a heterogeneous unweighted HPC graph consisting of multiple typed nodes after revealing the manifold relations between the nodes. The GCN structure used here supports two tasks: i) determining whether a job will complete or fail and ii) predicting memory and CPU requirements by training the GCN semi-supervised classification model and regression models on the generated graph. The graph is partitioned into partitions using graph clustering. We conducted classification and regression experiments using the proposed framework on our HPC log dataset and evaluated predictions by our trained models against baselines using test_score, F1-score, precision, recall for classification, and R1 score for regression, showing that our framework achieves significant improvements.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

HPCGCN: A Predictive Framework on High Performance Computing Cluster Log Data Using Graph Convolutional Networks.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data

Lead the way for us

Journal: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data	Publication Date: Dec 15, 2021
Citations: 2

Similar Papers

A Domain-Specific Language for High-Level Parallelization
Ritu Arora ... Marjan Mernik
-
Ritu Arora, et. al.Ritu Arora ... Marjan Mernik
26 Nov 2015
26 Nov 2015

CommunityGCN: community detection using node classification with graph convolution network
Riju Bhattacharya ... Sarsij Tripathi
Data Technologies and Applications | VOL. 57
Riju Bhattacharya, et. al.Riju Bhattacharya ... Sarsij Tripathi
07 Feb 2023
Data Technologies and Applications | VOL. 57

Power and energy efficient routing for Mach-Zehnder interferometer based photonic switches
Markos Kynigos ... Mikel Lujan
-
Markos Kynigos, et. al.Markos Kynigos ... Mikel Lujan
03 Jun 2021
03 Jun 2021

An Efficient Recommendation Algorithm Integrating Knowledge Graph with Graph Convolutional Networks
Changzheng Xing ... Jialong Guo
-
Changzheng Xing, et. al.Changzheng Xing ... Jialong Guo
01 Feb 2023
01 Feb 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

HPCGCN: A Predictive Framework on High Performance Computing Cluster Log Data Using Graph Convolutional Networks.

Abstract

Talk to us

Similar Papers

More From: Proceedings : ... IEEE International Conference on Big Data. IEEE International Conference on Big Data