Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores

Matteo Grimaldi,Valentino Peluso,Andrea Calimera

doi:10.1109/access.2019.2948577

Matteo Grimaldi, Valentino Peluso + Show 1 more

Open Access

https://doi.org/10.1109/access.2019.2948577

Copy DOI

Abstract

A cost-effective implementation of Convolutional Neural Nets on the mobile edge of the Internet-of-Things (IoT) requires smart optimizations to fit large models into memory-constrained cores. Reduction methods that use a joint combination of filter pruning and weight quantization have proven efficient in searching the compression that ensures minimum model size without accuracy loss. However, there exist other optimal configurations that stem from the memory constraint. The objective of this work is to make an assessment of such memory-bounded implementations and to show that most of them are centred on specific parameter settings that are found difficult to be implemented on a low-power RISC. Hence, the focus is on quantifying the distance to optimality of the closest implementations that instead can be actually deployed on hardware. The analysis is powered by a two-stage framework that efficiently explores the memory-accuracy space using a lightweight, hardware-conscious heuristic optimization. Results are collected from three realistic IoT tasks (Image Classification on CIFAR-10, Keyword Spotting on the Speech Commands Dataset, Facial Expression Recognition on Fer2013) run on RISC cores (Cortex-M by ARM) with few hundreds KB of on-chip RAM.

Highlights

AND MOTIVATIONSMost IoT applications run Deep Convolutional Neural Networks (ConvNets hereafter) in the cloud, public or private depending on the context
The analysis aims to assess the optimality of hardware-compliant implementations and quantify their distance from theoretical solutions
We introduce the ConvNets adopted as test-cases, together with the datasets used for the training stage and the evaluation

Summary

Introduction

AND MOTIVATIONSMost IoT applications run Deep Convolutional Neural Networks (ConvNets hereafter) in the cloud, public or private depending on the context. The focus of this work is on low-cost IoT applications (e.g. that described in [2]) where form-factor and energy budget are the main concern. In such cases, the software stack is developed over off-theshelf embedded platforms powered by tiny RISC cores. In the early years of life, ConvNets were mainly optimized to improve accuracy This brought to an exponential increase in size and complexity. The rise of edge computing pushed memory and storage capacity in the loop During this fast evolution, several optimization methods have been introduced and tested on different architectures; a thorough overview is reported in [3]. This section gives a critical review of prior arts, motivating the choices implemented in this work

Objectives

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE access : practical innovations, open solutions	Publication Date: Jan 1, 2019
Citations: 11	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions

Lead the way for us

Similar Papers

Exploiting privileged information for facial expression recognition
Michalis Vrigkas ... Ioannis A Kakadiaris
-
Michalis Vrigkas, et. al.Michalis Vrigkas ... Ioannis A Kakadiaris
01 Jun 2016
01 Jun 2016

Person-Independent Facial Expression Recognition Based on Improved Local Binary Pattern and Higher-Order Singular Value Decomposition
Ying He ... Shuxin Chen
IEEE access : practical innovations, open solutions | VOL. 8
Ying He, et. al.Ying He ... Shuxin Chen
01 Jan 2020
IEEE access : practical innovations, open solutions | VOL. 8

Facial expression recognition (FER) survey: a vision, architectural elements, and future directions
Sana Ullah ... Jie Ou
PeerJ. Computer science | VOL. 10
Sana Ullah, et. al.Sana Ullah ... Jie Ou
03 Jun 2024
PeerJ. Computer science | VOL. 10

Arbitrary-Precision Convolutional Neural Networks on Low-Power IoT Processors
Valentino Peluso ... Andrea Calimera
-
Valentino Peluso, et. al.Valentino Peluso ... Andrea Calimera
01 Oct 2019
01 Oct 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimality Assessment of Memory-Bounded ConvNets Deployed on Resource-Constrained RISC Cores

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE access : practical innovations, open solutions