Shiftry: RNN inference in 2KB of RAM

Aayan Kumar,Vivek Seshadri,Rahul Sharma

doi:10.1145/3428250

Abstract

Traditionally, IoT devices send collected sensor data to an intelligent cloud where machine learning (ML) inference happens. However, this course is rapidly changing and there is a recent trend to run ML on the edge IoT devices themselves. An intelligent edge is attractive because it saves network round trip (efficiency) and keeps user data at the source (privacy). However, the IoT devices are much more resource constrained than the cloud, which makes running ML on them challenging. Specifically, consider Arduino Uno, a commonly used board, that has 2KB of RAM and 32KB of read-only Flash memory. Although recent breakthroughs in ML have created novel recurrent neural network (RNN) models that provide good accuracy with KB-sized models, deploying them on tiny devices with such hard memory requirements has remained elusive. We provide, Shiftry, an automatic compiler from high-level floating-point ML models to fixed-point C-programs with 8-bit and 16-bit integers, which have significantly lower memory requirements. For this conversion, Shiftry uses a data-driven float-to-fixed procedure and a RAM management mechanism. These techniques enable us to provide first empirical evaluation of RNNs running on tiny edge devices. On simpler ML models that prior work could handle, Shiftry-generated code has lower latency and higher accuracy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Shiftry: RNN inference in 2KB of RAM

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Journal: Proceedings of the ACM on Programming Languages	Publication Date: Nov 13, 2020
Citations: 12

Similar Papers

Adaptive Early Exit of Computation for Energy-Efficient and Low-Latency Machine Learning over IoT Networks
Eric Samikwa ... Torsten Braun
-
Eric Samikwa, et. al.Eric Samikwa ... Torsten Braun
08 Jan 2022
08 Jan 2022

A machine-learning-based approach for predicting the geomagnetic secular variation
Sho Sato ... Hiroaki Toh
-
Sho Sato, et. al.Sho Sato ... Hiroaki Toh
11 Mar 2024
11 Mar 2024

Compiling KB-sized machine learning models to tiny IoT devices
Sridhar Gopinath ... Rahul Sharma
-
Sridhar Gopinath, et. al.Sridhar Gopinath ... Rahul Sharma
08 Jun 2019
08 Jun 2019

A review of on-device machine learning for IoT: An energy perspective
Nazli Tekin ... Vehbi Cagri Gungor
Ad Hoc Networks | VOL. 153
Nazli Tekin, et. al.Nazli Tekin ... Vehbi Cagri Gungor
10 Nov 2023
Ad Hoc Networks | VOL. 153

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Shiftry: RNN inference in 2KB of RAM

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages