BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference.

Hongbo Zhou,Weiwei Zhang,Chengwei Wang,Haoran Yu,Xin Ma

doi:10.3390/s21134494

Abstract

Edge-cloud collaborative inference can significantly reduce the delay of a deep neural network (DNN) by dividing the network between mobile edge and cloud. However, the in-layer data size of DNN is usually larger than the original data, so the communication time to send intermediate data to the cloud will also increase end-to-end latency. To cope with these challenges, this paper proposes a novel convolutional neural network structure—BBNet—that accelerates collaborative inference from two levels: (1) through channel-pruning: reducing the number of calculations and parameters of the original network; (2) through compressing the feature map at the split point to further reduce the size of the data transmitted. In addition, This paper implemented the BBNet structure based on NVIDIA Nano and the server. Compared with the original network, BBNet’s FLOPs and parameter achieve up to 5.67× and 11.57× on the compression rate, respectively. In the best case, the feature compression layer can reach a bit-compression rate of 512×. Compared with the better bandwidth conditions, BBNet has a more obvious inference delay when the network conditions are poor. For example, when the upload bandwidth is only 20 kb/s, the end-to-end latency of BBNet is increased by 38.89× compared with the cloud-only approach.

Highlights

In recent years, deep learning has achieved awesome performance in various smartapplication scenarios [1,2]
We propose a novel convolutional neural network structure in edge-cloud collaborative inference which economizes end-to-end latency through accelerated inference from two directions
Based on the related work, we proposed the structure of BBNet shown in Figure 1, which combines three technologies, namely model compression, deep neural network (DNN) model partition, and feature compression

Summary

Introduction

Deep learning has achieved awesome performance in various smartapplication scenarios [1,2]. Regarding cloud-only, it deploys the DNN model in the cloud, sends the original data directly to the cloud, and returns the inference result. In this way, the original data are transmitted in the channel, which is a threat to sensitive data and increases communication delay. Kang et al [4] proposed a new method of deep neural network partition deployment to achieve the effect of joint inference on the edge device and the cloud. The edge device deploys the early layer of the neural network and uploads the intermediate feature data to the cloud to execute the remaining network layer

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Sensors (Basel, Switzerland)	Publication Date: Jun 30, 2021
Citations: 13	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)

Lead the way for us

Similar Papers

Deep distributed convolutional neural networks: Universality
Ding-Xuan Zhou
Analysis and Applications | VOL. 16
Ding-Xuan ZhouDing-Xuan Zhou
01 Nov 2018
Analysis and Applications | VOL. 16

Deep Convolutional Neural Networks
Ding‐Xuan Zhou ... Liu Bie Ju
-
Ding‐Xuan Zhou, et. al.Ding‐Xuan Zhou ... Liu Bie Ju
18 Aug 2021
18 Aug 2021

Mobility-Included DNN Partition Offloading from Mobile Devices to Edge Clouds.
Xianzhong Tian ... Yanjun Li
Sensors (Basel, Switzerland) | VOL. 21
Xianzhong Tian, et. al.Xianzhong Tian ... Yanjun Li
01 Jan 2020
Sensors (Basel, Switzerland) | VOL. 21

Deep stable neural networks: Large-width asymptotics and convergence rates
Stefano Favaro ... Sandra Fortini
Bernoulli | VOL. 29
Stefano Favaro, et. al.Stefano Favaro ... Sandra Fortini
01 Aug 2023
Bernoulli | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

BBNet: A Novel Convolutional Neural Network Structure in Edge-Cloud Collaborative Inference.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Sensors (Basel, Switzerland)