Hardware Platform-Aware Binarized Neural Network Model Optimization

Quang Hieu Vo,Batyrbek Alimkhanuly,Lokwon Kim,Faaiz Asim,Seunghyun Lee

doi:10.3390/app12031296

Abstract

Deep Neural Networks (DNNs) have shown superior accuracy at the expense of high memory and computation requirements. Optimizing DNN models regarding energy and hardware resource requirements is extremely important for applications with resource-constrained embedded environments. Although using binary neural networks (BNNs), one of the recent promising approaches, significantly reduces the design’s complexity, accuracy degradation is inevitable when reducing the precision of parameters and output activations. To balance between implementation cost and accuracy, in addition to proposing specialized hardware accelerators for corresponding specific network models, most recent software binary neural networks have been optimized based on generalized metrics, such as FLOPs or MAC operation requirements. However, with the wide range of hardware available today, independently evaluating software network structures is not good enough to determine the final network model for typical devices. In this paper, an architecture search algorithm based on estimating the hardware performance at the design time is proposed to achieve the best binary neural network models for hardware implementation on target platforms. With the XNOR-net used as a base architecture and target platforms, including Field Programmable Gate Array (FPGA), Graphic Processing Unit (GPU), and Resistive Random Access Memory (RRAM), the proposed algorithm shows its efficiency by giving more accurate estimation for the hardware performance at the design time than FLOPs or MAC operations.

Highlights

In recent years, deep learning has demonstrated incredible performance in diverse research areas with different tasks, such as classification and detection [1,2]
In addition to deploying optimal accelerators and optimizing based on accuracy and generalized metrics, such as FLOPs, MAC operations are a popular method applied for most neural network models
Only evaluating on software level is not enough to select the optimal model for hardware implementation

Summary

Introduction

Deep learning has demonstrated incredible performance in diverse research areas with different tasks, such as classification and detection [1,2]. To design a hardware platform-aware optimal BNN, we propose a new framework that can analyze and explore the ultimate software model based on the estimation of the hardware performance with optimal effort and an architecture search for the training period. We present a neural network search algorithm called Deepbit, which can explore optimal BNN models for target hardware platforms by using the binary search method and hardware cost estimation charts. Since the training is performed on GPU servers that are resourceful environments in terms of computing performance, power, and temperature, it is much favorable to increase the computation at the training time if it results in an efficient neural network for the targeted hardware.

Related Work

Basic Design Strategies

Proposed Architecture Search Solution

Hardware Cost Estimation

Architectural Search via Deepbit Method

Result

Optimal BNN Search

Estimating the Hardware Costs for Optimal Models

Analysis and Discussion

Findings

Future Work

Conclusions

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Jan 26, 2022
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Hardware Platform-Aware Binarized Neural Network Model Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Improving Efficiency and Accuracy for Training and Inference of Hardware-aware Machine Learning Systems

-

12 Mar 2020
12 Mar 2020

FPGA-based acceleration for binary neural networks in edge computing
Jin-Yu Zhan ... Jun-Huan Yang
Journal of Electronic Science and Technology | VOL. 21
Jin-Yu Zhan, et. al.Jin-Yu Zhan ... Jun-Huan Yang
01 Jun 2023
Journal of Electronic Science and Technology | VOL. 21

A Deep Learning Accelerator Based on a Streaming Architecture for Binary Neural Networks
Quang Hieu Vo ... Lok-Won Kim
IEEE Access | VOL. 10
Quang Hieu Vo, et. al.Quang Hieu Vo ... Lok-Won Kim
01 Jan 2021
IEEE Access | VOL. 10

ReBNN: in-situ acceleration of binarized neural networks in ReRAM using complementary resistive cell
Linghao Song ... You Wu
CCF Transactions on High Performance Computing | VOL. 1
Linghao Song, et. al.Linghao Song ... You Wu
23 Oct 2019
CCF Transactions on High Performance Computing | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hardware Platform-Aware Binarized Neural Network Model Optimization

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences