Abstract

Convolutional neural network (CNN)-based object detection is a key technology to enable autonomous mobile vision applications on mobile end devices such as smart phones and drones. With the advance of edge computing technology, a prevalent solution is to offload the computation-intensive CNN inference tasks to edge networks for fast and accurate object detection. However, a single edge server may not be powerful enough to ensure fast and accurate object detection due to its resource constraints. In this paper, we propose a multi-edge assisted fast object detection framework, MASS, to further reduce the object detection latency while maintaining the detection accuracy. In MASS, the CNN model is divided into two parts, namely the Head part and the Tail part. The Head part is executed locally and the Tail part is further split into multiple subtasks, which can be offloaded to multiple edge servers for parallel execution. First, we propose a method to select the optimal parallel entry point to separate the Head part from the Tail part. Then, an adaptive subtask generation and offloading strategy is proposed to divide the Tail part into multiple subtasks and offload these subtasks to multiple heterogeneous edge servers. Besides, we also propose a uniformly sampled zero-padding scheme to reduce the communication cost among edge servers when executing these subtasks in parallel. We implement MASS in a testbed with four edge servers and evaluate its performance, where the experimental results show that MASS can reduce object detection latency by up to $64.83%$, while the detection accuracy degradation is less than $3%$.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call