MAC-DNN: An Optimized Hardware Implementation of MAC Unit in Neuron Engine for Deep Neural Network Applications

Rajesh Verma,Santosh Kumar Vishwakarma,Rajesh M Bodade,Pramod Kumar

doi:10.1080/03772063.2024.2448584

Rajesh Verma, Santosh Kumar Vishwakarma + Show 2 more

https://doi.org/10.1080/03772063.2024.2448584

Copy DOI

Export

Save

Cite

Journal: IETE Journal of Research

Publication Date: Jan 23, 2025

Abstract
Full-Text
Similar Papers

Abstract

Listen

Object detection and analysis using deep neural networks (DNNs) have become significant challenges due to their computational and power requirements. Hence, such computation is possible on general-purpose platforms like central processing units (CPUs), graphic processing units (GPUs), application-specific integrated circuits (ASICs), and field-programmable gate arrays (FPGAs). However, the development of high computational platforms is a critical challenge for efficient edge computing tasks like object detection, where power and bandwidth are low but faster and energy-efficient solutions are required. System-on-chip (SoC) designs are an optimistic solution for addressing these challenges. This study presents the power and delay-optimized Multiply-Accumulate (MAC) unit architecture for DNN and compares the parameters of 4-bit, 8-bit, 12-bit, and 16-bit MAC units. Vivado software has been employed to construct the MAC unit. It can carry out addition, accumulation, and multiplication operations. The design is analyzed and simulated using the Vivado High-Level Synthesis (HLS) tool, which is subsequently deployed on the Zybo Evaluation and Development Kit. The proposed approach outperforms the existing state-of-the-art models in terms of processing time and power for different precisions.

Full Text