Deep learning-based image recognition systems have rapidly evolved. Due to the extensive processing load of the deep neural network (DNN) on graphic processing units (GPUs), the DNN model is deployed on the cloud server. Images or videos are forwarded from user terminals through the network to the server. In recent years, edge computing has gained popularity as a means of reducing the data traffic in the backbone network. However, the last one-mile access network between an edge server and user terminals will still be congested because a large amount of data such as video/image files must be forwarded. In particular, when computer vision applications such as image recognition are loaded in the edge network, a large amount of data is forwarded although the edge server always may not need the high-definition image. This paper proposes an image compression and progressive retransmission scheme for deep learning-based image recognition systems to reduce image data traffic and alleviate network congestion. The proposed method introduces an entropy-based threshold calculated from posterior probabilities from a deep learning model’s output layer. Entropy is an extremely effective metric because it can be used as an indicator independent of the number of classification labels in the DNN model. The thresholding can control the image retransmission and reduce traffic while maintaining image recognition accuracy.We implement the proposed scheme on the edge server and reveal the relationship between the data compression and the recognition accuracy through simulation evaluation. As a result, we indicate that an entropy-based threshold reduces the overall ambiguity of the accuracy of image recognition. Moreover, when a higher accuracy recognition model with more accuracy is combined with a retransmission scheme, it becomes the more effective.
Read full abstract