Impact of Embedded Deep Learning Optimizations for Inference in Wireless IoT Use Cases

Jaron Fontaine,Eli De Poorter,Adnan Shahid,Ben Van Herbruggen

doi:10.1109/iotm.001.2200158

Abstract

Wireless internet-of-things devices typically transmit data to high-end computing platforms such as edge devices or cloud devices for further processing. However, processing the data at the edge or even on the constrained embedded devices using neural network can eliminate the need for high-throughput links and provides several benefits in terms of latency, reliability, privacy and energy consumption. In this article, we quantify how efficient neural networks can run on embedded devices for three typical wireless use cases. To this end, we give an overview of different optimizations and strategies for embedded deep learning inference on constrained devices. Next, we quantify the performance impact of optimized neural networks for edge and embedded inference, which perform up to 2.5x and 20x faster and consume 20x less energy at the cost of less than 2 percent accuracy difference for classification models. Although most published oversized models cannot run on typical embedded devices, with optimizations, we achieve efficient embedded inference and mitigate the need for raw data transmissions and thus preserve privacy. Finally, we discuss trends found in embedded deep learning use cases and present insights between design- and run-time metrics to predict model memory, storage and energy consumption together with model inference time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Impact of Embedded Deep Learning Optimizations for Inference in Wireless IoT Use Cases

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Magazine

Lead the way for us

Journal: IEEE Internet of Things Magazine	Publication Date: Dec 1, 2022
Citations: 4

Similar Papers

Efficient Neural Networks on the Edge with FPGAs by Optimizing an Adaptive Activation Function.
Yiyue Jiang ... Miriam Leeser
Sensors | VOL. 24
Yiyue Jiang, et. al.Yiyue Jiang ... Miriam Leeser
13 Mar 2024
Sensors | VOL. 24

Differentiable neural architecture learning for efficient neural networks
Qingbei Guo ... Zhiquan Feng
Pattern Recognition | VOL. 126
Qingbei Guo, et. al.Qingbei Guo ... Zhiquan Feng
22 Jan 2022
Pattern Recognition | VOL. 126

Finding the Optimal Topology of an Approximating Neural Network
Kostadin Yotov ... Stoyan Cheresharov
Mathematics | VOL. 11
Kostadin Yotov, et. al.Kostadin Yotov ... Stoyan Cheresharov
01 Jan 2023
Mathematics | VOL. 11

EDCompress: Energy-Aware Model Compression for Dataflows.
Zhehui Wang ... Rick Siow Mong Goh
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP
Zhehui Wang, et. al.Zhehui Wang ... Rick Siow Mong Goh
01 Jan 2024
IEEE Transactions on Neural Networks and Learning Systems | VOL. PP

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Impact of Embedded Deep Learning Optimizations for Inference in Wireless IoT Use Cases

Abstract

Talk to us

Similar Papers

More From: IEEE Internet of Things Magazine