Abstract
This paper proposes a background noise adaptive energy-efficient keywords recognition processor with Reusable DNN (RDNN) and reconfigurable architecture. To reduce power consumption while maintaining the recognition accuracy of different background noises, the SNR prediction module determines whether the computing mode is low power consumption mode (LPM) or high performance mode (HPM). In LPM, DNN-shift (shift-based deep neural network) is used to achieve high recognition accuracy in a low background noise environment; in HPM, DNN-8bit (8bit weighted deep neural network) is used to achieve low power consumption in a high background noise environment. And the two modes share most of the hardware, and approximate computing is introduced to further reduce power consumption. Evaluated under 22nm process technology, this work can support up to 10 keywords recognition with the power consumption of <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$11.2~\mu \text{W}$ </tex-math></inline-formula> for high background noise and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$7.3~\mu \text{W}$ </tex-math></inline-formula> for low background noise.
Highlights
Keywords recognition [1]–[3] refers to the targeted recognition of some specific words in certain scenarios when the user performs intelligent voice interaction
2) A reconfigurable architecture based on the SNR prediction module is designed for the proposed Reusable deep neural network (DNN) (RDNN) to realize high recognition accuracy and low power consumption in different background noise scenarios
We have proposed a convolutional neural network architecture based on a voltage domain analog switching network
Summary
Keywords recognition [1]–[3] refers to the targeted recognition of some specific words in certain scenarios when the user performs intelligent voice interaction. The main indicators of keyword recognition include: power consumption, memory and recognition accuracy On this basis, many different methods have been proposed and used to detect specific vocabulary in speech. 1) A RDNN which consists of DNN-shift and DNN8bit with different quantization methods is proposed This RDNN can achieve high recognition accuracy and low power consumption under various background noise environments with a wide range of SNR from 0dB to clean. 2) A reconfigurable architecture based on the SNR prediction module is designed for the proposed RDNN to realize high recognition accuracy and low power consumption in different background noise scenarios.
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have