Data Reuse for Accelerated Approximate Warps

Daniel Peroni,Hamid Nejatollahi,Nikil Dutt,Mohsen Imani,Tajana Rosing

doi:10.1109/tcad.2020.2986128

Abstract

Many data-driven applications, including computer vision, machine learning, speech recognition, and medical diagnostics show tolerance to computation error. These applications are often accelerated on GPUs, but the performance improvements require high energy usage. In this article, we present DRAAW, an approximate computing technique capable of accelerating GPGPU applications at a warp level. In GPUs, warps are groups of threads which issued together across multiple cores. The slowest thread dictates the pace of the warp, so DRAAW identifies these bottlenecks and avoids them during approximation. We alleviate computation costs by using an approximate lookup table which tracks recent operations and reuses them to exploit temporal locality within applications. To improve neural network performance, we propose neuron aware approximation, a technique which profiles operations within network layers and automatically configures DRAAW to ensure computations with more impact on the output accuracy are subject to less approximation. We evaluate our design by placing DRAAW within each core of an Nvidia Kepler Architecture Titan. DRAAW improves throughput by up to $2.8\times $ and improves energy-delay product (EDP) by $5.6\times $ for six GPGPU applications while maintaining less than 5% output error. We show neuron aware approximation accelerates the inference of six neutral networks by $2.9\times $ and improves EDP by $6.2\times $ with less than 1% impact on prediction accuracy.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Dec 1, 2020
Citations: 42	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Data Reuse for Accelerated Approximate Warps

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Similar Papers

ARGA
Daniel Peroni ... Tajana Rosing
-
Daniel Peroni, et. al.Daniel Peroni ... Tajana Rosing
02 Jun 2019
02 Jun 2019

Prediction of the importance of auxiliary traits using computational intelligence and machine learning: A simulation study.
Antônio Carlos Da Silva Júnior ... Isabela De Castro Sant’Anna
PloS one | VOL. 16
Antônio Carlos Da Silva Júnior, et. al.Antônio Carlos Da Silva Júnior ... Isabela De Castro Sant’Anna
29 Nov 2021
PloS one | VOL. 16

Machine Learning Meets Databases
Stephan Günnemann
Datenbank-Spektrum | VOL. 17
Stephan GünnemannStephan Günnemann
31 Jan 2017
Datenbank-Spektrum | VOL. 17

Gesture speak: Hands-Free Computer Control with Hand Gestures and Voice Commands
S Gopalakrishnan ... K Sankar
Indian Journal of Computer Science and Technology | VOL. -
S Gopalakrishnan, et. al.S Gopalakrishnan ... K Sankar
23 May 2024
Indian Journal of Computer Science and Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Data Reuse for Accelerated Approximate Warps

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems