Interactive visual analytics of parallel training strategies for DNN models

Zhongwei Wang,Yating Wei,Gongchang Ou,Han Gao,Haitao Yang,Yue Wang,Chen Cao,Minfeng Zhu,Wei Chen

doi:10.1016/j.cag.2023.07.030

Abstract

Understanding and optimizing the parallel training strategies in the training of large-scale Deep Neural Network (DNN) models is crucial to enhance training efficiency. Existing works tried to demonstrate the layer-level information of the computational graph to support parallel training strategy selection. Whereas, the overall parallel execution logic is rarely considered by previous methods. In this paper, we proposed a novel visual analytics approach for parallel training strategies, demonstrating the execution logic of the distributed computing from up to bottom via explaining communication operators. Specifically, a computation-communication bipartite construction algorithm is designed for the computational graph visualization. Furthermore, a system is developed to help users easily access the proposed approach and explore the parallel training strategies interactively. With empirical evaluation through a quantitative user study and a qualitative expert interview, the practicality and superiority of the proposed approach is verified.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Interactive visual analytics of parallel training strategies for DNN models

Abstract

Talk to us

Similar Papers

More From: Computers & Graphics

Lead the way for us

Journal: Computers & Graphics	Publication Date: Jul 17, 2023
Citations: 1

Similar Papers

Visual Diagnostics of Parallel Performance in Training Large-Scale DNN Models.
Yating Wei ... Han Gao
IEEE transactions on visualization and computer graphics | VOL. 30
Yating Wei, et. al.Yating Wei ... Han Gao
01 Jan 2024
IEEE transactions on visualization and computer graphics | VOL. 30

A Framework for Distributed Deep Neural Network Training with Heterogeneous Computing Platforms
Bontak Gu ... Arslan Munir
-
Bontak Gu, et. al.Bontak Gu ... Arslan Munir
01 Dec 2019
01 Dec 2019

PipePar: Enabling fast DNN pipeline parallel training in heterogeneous GPU clusters
Jinghui Zhang ... Zhiang Wu
Neurocomputing | VOL. 555
Jinghui Zhang, et. al.Jinghui Zhang ... Zhiang Wu
04 Aug 2023
Neurocomputing | VOL. 555

AccelAT: A Framework for Accelerating the Adversarial Training of Deep Neural Networks Through Accuracy Gradient
Farzad Nikfam ... Alberto Marchisio
IEEE Access | VOL. 10
Farzad Nikfam, et. al.Farzad Nikfam ... Alberto Marchisio
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Interactive visual analytics of parallel training strategies for DNN models

Abstract

Talk to us

Similar Papers

More From: Computers &amp; Graphics

More From: Computers & Graphics