DFSNet: Dividing-fuse deep neural networks with searching strategy for distributed DNN architecture

Wenxuan Hou,Longjun Liu,Haonan Zhang,Hongbin Sun,Nanning Zheng

doi:10.1016/j.neucom.2021.08.144

Abstract

The overwhelming parameters and computation consumption of deep neural networks limit their applicability to a single computing node with poor computing power, such as edge and mobile devices. Most previous works leverage model pruning and compression strategies to reduce DNN parameters for resource-constrained devices. However, most model compression methods may suffer from accuracy loss. Recently, we find that combine many weak computing nodes as a distributed system to run large and sophisticated DNN models is a promising solution for the issue. However, it is essential for the distributed system to design distributed DNN models and inference schemes, one of the great challenges of distributed system is how to design an efficient distributed DNN model for data parallelism and model parallelism, and communication overhead is also another critical performance bottleneck for distributed DNN model. Therefore, in this article, we propose DFSNet framework (Dividing-Fuse neural Network with Searching Strategy) for distributed DNN architecture. Firstly, the DFSNet framework includes a joint ”dividing-fusing” method to convert regular DNN models into distributed models that are friendly for distributed systems. This method divides the conventional DNN model in the channel dimension, and sets a few special layers to fuse feature-map information from different channel groups for accuracy improvement. Since the fusion layers are sparse in the network, they do not increase too much extra inference time and communication overhead on the distributed nodes, but they can maintain the accuracy of distributed neural networks significantly. Secondly, considering the architecture of distributed computing nodes, we propose a parallel fusion topology to improve the utilization of different computing nodes. Lastly, the popular weight-sharing neural architecture search (NAS) technique is leveraged to search the position of fusion layers in the distributed DNN model for high accuracy and finally generate an efficient distributed DNN model. Compared with the original network, our converted distributed DNN achieves better performance (e.g. 1.88% precision boosting in ResNet56 on CIFAR-100 dataset, and 1.25% precision improving in MobileNetV2 on ImageNet dataset). In addition, most layers of DNN have been divided into different distributed nodes on channel dimension, which is particularly suitable for distributed DNN architecture with very low communication overhead.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DFSNet: Dividing-fuse deep neural networks with searching strategy for distributed DNN architecture

Abstract

Talk to us

Similar Papers

More From: Neurocomputing

Lead the way for us

Journal: Neurocomputing	Publication Date: Nov 2, 2021
Citations: 2

Similar Papers

Sparse CNN Architecture Search (Scas)
V Yeshwanth ... Seungwon Lee
-
V Yeshwanth, et. al.V Yeshwanth ... Seungwon Lee
01 Jul 2020
01 Jul 2020

Research and analysis of pruning algorithm
Xin Jin ... Xuanmin Ma
-
Xin Jin, et. al.Xin Jin ... Xuanmin Ma
06 May 2022
06 May 2022

Knowledge from the original network: restore a better pruned network with knowledge distillation
Liyang Chen ... Xinyi Le
Complex & Intelligent Systems | VOL. 8
Liyang Chen, et. al.Liyang Chen ... Xinyi Le
10 Jan 2021
Complex & Intelligent Systems | VOL. 8

An Improved K-Spare Decomposing Algorithm for Mapping Neural Networks onto Crossbar-Based Neuromorphic Computing Systems
Thanh D Dao ... Jaeyong Chung
Journal of Low Power Electronics and Applications | VOL. 10
Thanh D Dao, et. al.Thanh D Dao ... Jaeyong Chung
25 Nov 2020
Journal of Low Power Electronics and Applications | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DFSNet: Dividing-fuse deep neural networks with searching strategy for distributed DNN architecture

Abstract

Talk to us

Similar Papers

More From: Neurocomputing