Low Latency Implementations of CNN for Resource-Constrained IoT Devices

Ahmed Mujtaba,Seong Oun Hwang,Wai-Kong Lee

doi:10.1109/tcsii.2022.3205029

Abstract

Convolutional Neural Network (CNN) inference on a resource-constrained Internet-of-Things (IoT) device (i.e., ARM Cortex-M microcontroller) requires careful optimization to reduce the timing overhead. We propose two novel techniques to improve the computational efficiency of CNNs by targeting low-cost microcontrollers. Our techniques utilize on-chip memory and minimize redundant operations, yielding low-latency inference results on complex quantized models such as MobileNetV1. On the ImageNet dataset for per-layer quantization, we reduce inference latency and Multiply-and-Accumulate (MAC) per cycle by 22.4% and 22.9%, respectively, compared to the state-of-the-art mixed-precision CMix-NN library. On the CIFAR-10 dataset for per-channel quantization, we reduce inference latency and MAC per cycle by 31.7% and 31.3%, respectively. The achieved low-latency inference results can improve the user experience and save power budget in resource-constrained IoT devices.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low Latency Implementations of CNN for Resource-Constrained IoT Devices

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems II: Express Briefs	Publication Date: Dec 1, 2022
Citations: 3

Similar Papers

DNN Model Compression for IoT Domain-Specific Hardware Accelerators
Enrico Russo ... Giuseppe Ascia
IEEE Internet of Things Journal | VOL. 9
Enrico Russo, et. al.Enrico Russo ... Giuseppe Ascia
01 May 2022
IEEE Internet of Things Journal | VOL. 9

Cyberattack detection and prevention on resource-constrained IoT devices based on intelligent agents
Huy-Trung Nguyen ... Quoc-Dung Ngo
-
Huy-Trung Nguyen, et. al.Huy-Trung Nguyen ... Quoc-Dung Ngo
28 Sep 2022
28 Sep 2022

A Deep Learning Framework for Securing IoT Against Malwares
Mustafa El ... Aaras Y Y.Kraidi
Journal of Cybersecurity and Information Management | VOL. -
Mustafa El , et. al.Mustafa El ... Aaras Y Y.Kraidi
01 Jan 2023
Journal of Cybersecurity and Information Management | VOL. -

Information management for trust computation on resource-constrained IoT devices
Matthew Bradbury ... Tim Watson
Future Generation Computer Systems | VOL. 135
Matthew Bradbury, et. al.Matthew Bradbury ... Tim Watson
14 May 2022
Future Generation Computer Systems | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low Latency Implementations of CNN for Resource-Constrained IoT Devices

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems II: Express Briefs