Abstract
Collaborative intelligence (CI) has been proposed to efficiently utilize computational resources on edge devices over edge-cloud environments. To realize this, a deep neural network (DNN) model is divided into two parts, which are deployed to the edge device and cloud separately so that they can perform DNN inference collaboratively. Feature compression methods have been investigated to reduce the traffic volume between the edge and cloud because the data size of a feature map, which is the output of the edge device in CI, is relatively larger than that of the compressed original image. Although feature compression methods can achieve a good compression rate, the traffic volume is not small enough because the data size of a feature map might exceed that of the original image.Herein, we propose a novel feature compression method that utilizes an autoencoder (AE) to compress the feature map. In image compression, distortion and data size are minimized, whereas in our method, the AE is trained to minimize the prediction error and data size. In this way, the AE can extract only the data needed for the task of the DNN model and considerably reduce the data size while maintaining the prediction accuracy. In performance evaluation, we apply the proposed method to widely used convolutional neural network models: VGG16, ResNet50, and YOLOv3. Our results confirm that the proposed method drastically reduces the traffic volume while utilizing the resources of edge devices in CI. Our proposed method reduces the traffic volume by 62–99%, 16–99%, and 35–53% compared to versatile video coding in each model.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.