Real-time Hardware Feature Extraction with Embedded Signal Enhancement for Automatic Speech Recognition

Vinh Vu,James Whittington,John Devli

doi:10.5772/17718

Abstract

The concept of using speech for communicating with computers and other machines has been the vision of humans for decades. User input via speech promises overwhelming advantages compared with standard input/output peripherals, such as, mouse, keyboard, and buttons. To make this vision a reality, considerable effort and investment into automatic speech recognition (ASR) research has been conducted for over six decades. While current speech recognition systems perform very well in benign environments, their performance is rather limited inmany real-world settings. One of the main degrading factors in these systems is background noise collected along with the wanted speech. There are a wide range of possible uncorrelated noise sources. They are generally short lived and non-stationary. For example in the automotive environments, noise sources can be road noise, engine noise, or passing vehicles that compete with the speech. Noise can also be continuous, such as, wind noise, particularly from an open window, or noise from a ventilation or air conditioning unit. To make speech recognition systems more robust, there are a number of methods being investigated. These include the use of robust feature extraction and recognition algorithms as well as speech enhancement. Enhancement techniques aim to remove (or at least reduce) the levels of noise present in the speech signals, allowing clean speech models to be utilised in the recognition stage. This is a popular approach as little-or-no prior knowledge of the operating environment is required for improvements in recognition accuracy. While many ASR and enhancement algorithms or models have been proposed, an issue of how to implement them efficiently still remains. Many software implementations of the algorithms exist, but they are limited in application as they require relatively powerful general purpose processors. To achieve a real-time design with both low-cost and high performance, a dedicated hardware implementation is necessary. This chapter presents the design of a Real-time Hardware Feature Extraction System with Embedded Signal Enhancement for Automatic Speech Recognition appropriate for implementation in low-cost Field Programmable Gate Array (FPGA) hardware. While suitable for many other applications, the design inspiration was for automotive applications, requiring real-time, low-cost hardware without sacrificing performance. Main components of this design are: an efficient implementation of the Discrete Fourier Transform (DFT), speech enhancement, and Mel-Frequency Cepstrum Coefficients (MFCC) feature extraction. 2

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-time Hardware Feature Extraction with Embedded Signal Enhancement for Automatic Speech Recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jun 23, 2011
Citations: 7	License type: cc-by-nc-sa

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Speech Enhancement System for Automatic Speech Recognition in Automotive Environment
Gokul G Nair ... C Santhosh Kumar
-
Gokul G Nair, et. al.Gokul G Nair ... C Santhosh Kumar
06 Jul 2021
06 Jul 2021

An FPGA-Based Embedded Robust Speech Recognition System Designed by Combining Empirical Mode Decomposition and a Genetic Algorithm
Shing-Tai Pan ... Xu-Yu Li
IEEE Transactions on Instrumentation and Measurement | VOL. 61
Shing-Tai Pan, et. al.Shing-Tai Pan ... Xu-Yu Li
01 Sep 2012
IEEE Transactions on Instrumentation and Measurement | VOL. 61

FPGA-Based Hardware Accelerator for Feature Extraction in Automatic Speech Recognition
Chang Choo ... Il-Young Moon
Journal of information and communication convergence engineering | VOL. 13
Chang Choo, et. al.Chang Choo ... Il-Young Moon
28 Aug 2015
Journal of information and communication convergence engineering | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-time Hardware Feature Extraction with Embedded Signal Enhancement for Automatic Speech Recognition

Abstract

Talk to us

Similar Papers