Low-Energy Voice Activity Detection via Energy-Quality Scaling From Data Conversion to Machine Learning

Jinq Horng Teo,Massimo Alioto,Shuai Cheng

doi:10.1109/tcsi.2019.2960843

Abstract

In this work, voice activity detection (VAD) systems with system-level energy-quality (EQ) scaling are investigated. Compared to prior single-knob EQ scaling, multiple EQ knobs are selectively inserted into the entire signal chain from end to end. EQ knobs are dynamically co-optimized to minimize energy for a given quality target. The analysis shows that system-level EQ optimization provides several benefits and has interesting implications on the performance of machine learning-based classification, as exemplified by decision trees in this work. First, it can make quality degradation more graceful than single-knob, allowing for more aggressive energy reduction under a given quality target, while retaining the ability to operate at full quality. Also, proper system-level EQ optimization enhances fitting in machine learning-based systems (e.g., decision tree-based), suppressing both underfitting and overfitting. The analysis also shows that context-specific retraining significantly improves quality and resolves fitting issues, especially at low input SNR. Measurements on a 28nm testchip show that system-level EQ scaling can reduce energy by up to 3.5X at 2% accuracy degradation in 10-dB noise, compared to full quality. Iso-technology comparison shows that the minimum energy of 51.9 nJ/frame is lower than prior art by 1.9-74.4X at comparable speech/non-speech hit rates.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Low-Energy Voice Activity Detection via Energy-Quality Scaling From Data Conversion to Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers

Lead the way for us

Journal: IEEE Transactions on Circuits and Systems I: Regular Papers	Publication Date: Apr 1, 2020
Citations: 26

Similar Papers

Energy-Quality Scalable Analog-to-Digital Conversion and Machine Learning Engine in a 51.9 nJ/frame Voice Activity Detector
Jinq Horng Teo ... Shuai Cheng
-
Jinq Horng Teo, et. al.Jinq Horng Teo ... Shuai Cheng
01 Nov 2019
01 Nov 2019

Energy-Quality Scalable Integrated Circuits and Systems: Continuing Energy Scaling in the Twilight of Moore’s Law
Massimo Alioto ... Vivek De
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 8
Massimo Alioto, et. al.Massimo Alioto ... Vivek De
01 Dec 2018
IEEE Journal on Emerging and Selected Topics in Circuits and Systems | VOL. 8

A 1μW voice activity detector using analog feature extraction and digital deep neural network
Minhao Yang ... Mingoo Seok
-
Minhao Yang, et. al.Minhao Yang ... Mingoo Seok
01 Feb 2018
01 Feb 2018

Energy-Quality Scalable Integrated Systems - Preserving Energy Downscaling in the Decade Ahead
Massimo Alioto
-
Massimo AliotoMassimo Alioto
01 Mar 2019
01 Mar 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Low-Energy Voice Activity Detection via Energy-Quality Scaling From Data Conversion to Machine Learning

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Circuits and Systems I: Regular Papers