Collaborative Edge Computing With FPGA-Based CNN Accelerators for Energy-Efficient and Time-Aware Face Tracking System

Xing Liu,Yuao Chen,Jianfeng Yang,Chengming Zou,Xin Yan,Chenran Cai,Qimei Chen

doi:10.1109/tcss.2021.3059318

Abstract

Convolutional neural networks (CNNs) have become the critical technology to realize face detection and face recognition in the face tracking (FT) system. However, traditional CNNs usually have nontrivial computational time and high energy consumption, making them inappropriate to be deployed in the large-scale time-sensitive FT system. To address this challenge, we design an artificial intelligence and Internet of Things (AIoT) empowered edge-cloud collaborative computing (ECCC) system based on the energy-efficient field-programmable gate array (FPGA)-based CNN accelerators for the purpose of realizing a low-latency and low-power FT. First, we present the AIoT-empowered ECCC system architecture, which consists of an intelligent computing subsystem, an Internet-of-Things (IoT) subsystem, an edge-cloud collaborative subsystem, and an application subsystem. In what follows, we investigate the enabling technologies for these subsystems. Thereafter, we develop an FPGA-based hardware accelerator dedicated to the compact MobileNet CNN by using the hardware design techniques, such as systolic array, matrix tiling, fixed-point precision, and parallelism. Furthermore, we integrate the FPGA accelerators with CPUs and GPUs to build a context-aware CPU/GPU/FPGA heterogeneous computing system. Finally, we implement a delay-aware energy-efficient scheduling algorithm dedicated to this heterogeneous system. With the above hardware and software codesign mechanism, the energy cost and execution time of CNNs can be decreased significantly. The real-world experiments on the CPU/GPU/FPGA-based ECCC system proved the effectiveness of the proposed schemes in reducing the latency and improving the power efficiency of the FT system.

Full Text