Entropy-Driven Adaptive Filtering for High-Accuracy and Resource-Efficient FPGA-Based Neural Network Systems

Elim Yi Lam Kwan,Jose Nunez-Yanez

doi:10.3390/electronics9111765

Abstract

Binarized neural networks are well suited for FPGA accelerators since their fine-grained architecture allows the creation of custom operators to support low-precision arithmetic operations, and the reduction in memory requirements means that all the network parameters can be stored in internal memory. Although good progress has been made to improve the accuracy of binarized networks, it can be significantly lower than networks where weights and activations have multi-bit precision. In this paper, we address this issue by adaptively choosing the number of frames used during inference, exploiting the high frame rates that binarized neural networks can achieve. We present a novel entropy-based adaptive filtering technique that improves accuracy by varying the system’s processing rate based on the entropy present in the neural network output. We focus on using real data captured with a standard camera rather than using standard datasets that do not realistically represent the artifacts in video stream content. The overall design has been prototyped on the Avnet Zedboard, which achieved 70.4% accuracy with a full processing pipeline from video capture to final classification output, which is 1.9 times better compared to the base static frame rate system. The main feature of the system is that while the classification rate averages a constant 30 fps, the real processing rate is dynamic and varies between 30 and 142 fps, adapting to the complexity of the data. The dynamic processing rate results in better efficiency that simply working at full frame rate while delivering high accuracy.

Highlights

Neural networks running on general-purpose CPUs or GPUs are a common solution for image recognition problems, yet these solutions tend to be power-hungry or the level of performance falls below the requirements of many applications
This shows that entropy is more sensitive to sudden changes, which can be explained because the entropy calculation is based on the current p.m.f. only, while autocorrelation takes into account the temporal dependency among serial data with a lag k; it displays a certain level of memory effect
This paper has demonstrated how by considering more frames, the accuracy of a binary precision neural network can be improved in a real application scenario with data captured via a camera

Summary

Introduction

Neural networks running on general-purpose CPUs or GPUs are a common solution for image recognition problems, yet these solutions tend to be power-hungry or the level of performance falls below the requirements of many applications. Integer-precision neural networks deployed on FPGA accelerators have been shown to achieve very high performance per watt, the level of accuracy can degrade significantly if the quantization process is done to binary levels, and they are more prone to prediction errors. To address this issue, we present a novel entropy-based adaptive filter, which is lightweight and modular. We show how the accuracy of the low precision neural network working with real video data can be improved with increasing processing rates. Our end goal is to boost the real accuracy of low precision neural network systems and use adaptive schemes to adjust energy consumption dynamically based on data complexity

Background and Related Work

Methodology

Proposed Window Filter

Baseline and Regions Definition

Window Filter Evaluation

Proposed Uncertainty Estimation Measures

Scheme I

Scheme II

Scheme III

Uncertainty Estimation Schemes Evaluation

Adaptive Filtering

10. Overall Accuracy and Performance Analysis

10.1. Energy Consumption of Various Setups

10.2. Accuracy Gain under Diverse Setups

10.3. Overall Performance Gain

Findings

11. Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Electronics	Publication Date: Oct 23, 2020
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Entropy-Driven Adaptive Filtering for High-Accuracy and Resource-Efficient FPGA-Based Neural Network Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Reducing Frame Rate for Object Tracking
Pavel Korshunov ... Wei Tsang Ooi
-
Pavel Korshunov, et. al.Pavel Korshunov ... Wei Tsang Ooi
01 Jan 2009
01 Jan 2009

Frame rate reduction of depth cameras by RGB-based depth prediction
Daniel Rotman ... Guy Gilboa
-
Daniel Rotman, et. al.Daniel Rotman ... Guy Gilboa
01 Nov 2016
01 Nov 2016

Normalization effects on deep neural networks
Jiahui Yu ... Konstantinos Spiliopoulos
Foundations of Data Science | VOL. 5
Jiahui Yu, et. al.Jiahui Yu ... Konstantinos Spiliopoulos
01 Jan 2023
Foundations of Data Science | VOL. 5

Exploiting Temporal Correlation of Speech for Error Robust and Bandwidth Flexible Distributed Speech Recognition
Zheng-Hua Tan ... Brge Lindberg
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15
Zheng-Hua Tan, et. al.Zheng-Hua Tan ... Brge Lindberg
01 May 2007
IEEE Transactions on Audio, Speech and Language Processing | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Entropy-Driven Adaptive Filtering for High-Accuracy and Resource-Efficient FPGA-Based Neural Network Systems

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics