Runtime Efficiency-Accuracy Tradeoff Using Configurable Floating Point Multiplier

Daniel Peroni,Mohsen Imani,Tajana Simuni Rosing

doi:10.1109/tcad.2018.2885317

Abstract

Many applications, such as machine learning and sensor data analysis, are statistical in nature and can tolerate some level of inaccuracy in their computation. Approximate computing is a viable method to save energy and increase performance by controllably trading off energy for accuracy. In this paper, we propose a tiered approximate floating point multiplier, called CFPU, which significantly reduces energy consumption and improves the performance of multiplication at a slight cost in accuracy. The floating point multiplication is approximated by replacing the costly mantissa multiplication step of the operation with lower energy alternatives. We process the data by using one of the three modes: a basic approximate mode, an intermediate approximate mode, or on the exact hardware, depending on the accuracy requirements. We evaluate the efficiency of the proposed CFPU on a wide range of applications including twelve general OpenCL ones and three machine learning applications. Our results show that using the first CFPU approximation mode results in $3.5\times $ energy-delay product (EDP) improvement, compared to a GPU using traditional floating point units (FPUs), while ensuring less than 10% average relative error. Adding the second mode further increases the EDP improvement to $4.1\times $ , compared to an unmodified FPU, for less than 10% error. In addition, our results show that the proposed CFPU can achieve $2.8\times $ EDP improvement for multiply operations as compared to state-of-the-art approximate multipliers.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems	Publication Date: Feb 1, 2020
Citations: 45	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Runtime Efficiency-Accuracy Tradeoff Using Configurable Floating Point Multiplier

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

Lead the way for us

Similar Papers

ApproxLP
Mohsen Imani ... Baris Aksanli
-
Mohsen Imani, et. al.Mohsen Imani ... Baris Aksanli
02 Jun 2019
02 Jun 2019

Energy-Efficient Reconfigurable Computing Using a Circuit-Architecture-Software Co-Design Approach
Somnath Paul ... Swarup Bhunia
IEEE journal on emerging and selected topics in circuits and systems | VOL. 1
Somnath Paul, et. al.Somnath Paul ... Swarup Bhunia
01 Sep 2011
IEEE journal on emerging and selected topics in circuits and systems | VOL. 1

Hardware-Software Co-design to Accelerate Neural Network Applications
Mohsen Imani ... Ricardo Garcia
ACM journal on emerging technologies in computing systems | VOL. 15
Mohsen Imani, et. al.Mohsen Imani ... Ricardo Garcia
30 Apr 2019
ACM journal on emerging technologies in computing systems | VOL. 15

RMAC
Mohsen Imani ... Saransh Gupta
-
Mohsen Imani, et. al.Mohsen Imani ... Saransh Gupta
23 Jul 2018
23 Jul 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Runtime Efficiency-Accuracy Tradeoff Using Configurable Floating Point Multiplier

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems