A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems

Oliver Urbann,Arne Moos,Soren Kerner,Simon Camphausen,Maximilian Otten,Ingmar Schwarz

doi:10.1109/iemtronics51293.2020.9216395

Abstract

Inference of Convolutional Neural Networks in time critical applications usually requires a GPU. In robotics or embedded devices these are often not available due to energy, space and cost constraints. Furthermore, installation of a deep learning framework or even a native compiler on the target platform is not possible. This paper presents a neural network code generator (NNCG) that generates from a trained CNN a plain ANSI C code file that encapsulates the inference in single a function. It can easily be included in existing projects and due to lack of dependencies, cross compilation is usually possible. Additionally, the code generation is optimized based on the known trained CNN and target platform following four design principles. The system is evaluated utilizing small CNN designed for this application. Compared to TensorFlow XLA and Glow speed-ups of up to 11.81 can be shown and even GPUs are outperformed regarding latency.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Highly Efficient 8-bit Low Precision Inference of Convolutional Neural Networks with IntelCaffe
Jiong Gong ... Evarist Fomenko
-
Jiong Gong, et. al.Jiong Gong ... Evarist Fomenko
20 Jun 2018
20 Jun 2018

Performance Analysis of a Phase-Change Memory System on Various CNN Inference Workloads
Jihoon Jang ... Hyokeun Lee
-
Jihoon Jang, et. al.Jihoon Jang ... Hyokeun Lee
19 Oct 2022
19 Oct 2022

PICO: Pipeline Inference Framework for Versatile CNNs on Diverse Mobile Devices
Xiang Yang ... Haifeng Sun
IEEE Transactions on Mobile Computing | VOL. 23
Xiang Yang, et. al.Xiang Yang ... Haifeng Sun
01 Apr 2024
IEEE Transactions on Mobile Computing | VOL. 23

An Efficient Task Assignment Framework to Accelerate DPU-Based Convolutional Neural Network Inference on FPGAs
Jiang Zhu ... Jianqi Li
IEEE Access | VOL. 8
Jiang Zhu, et. al.Jiang Zhu ... Jianqi Li
01 Jan 2020
IEEE Access | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems

Abstract

Talk to us

Similar Papers