Feedforward-Cutset-Free Pipelined Multiply–Accumulate Unit for the Machine Learning Accelerator

Sungju Ryu,Naebeom Park,Jae-Joon Kim

doi:10.1109/tvlsi.2018.2873716

Feedforward-Cutset-Free Pipelined Multiply–Accumulate Unit for the Machine Learning Accelerator

Sungju Ryu, Naebeom Park + Show 1 more

https://doi.org/10.1109/tvlsi.2018.2873716

Copy DOI

Journal: IEEE Transactions on Very Large Scale Integration Systems	Publication Date: Jan 1, 2019
Citations: 28

Affiliation: Pohang University of Science and Technology

#Machine Learning Accelerator #Multiply–accumulate + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Multiply–accumulate (MAC) computations account for a large part of machine learning accelerator operations. The pipelined structure is usually adopted to improve the performance by reducing the length of critical paths. An increase in the number of flip-flops due to pipelining, however, generally results in significant area and power increase. A large number of flip-flops are often required to meet the feedforward-cutset rule. Based on the observation that this rule can be relaxed in machine learning applications, we propose a pipelining method that eliminates some of the flip-flops selectively. The simulation results show that the proposed MAC unit achieved a 20% energy saving and a 20% area reduction compared with the conventional pipelined MAC.

Full Text