Human factor cepstral coefficients

Mark D Skowronski,John G Harris

doi:10.1121/1.4779137

Abstract

Automatic speech recognition (ASR) is an emerging field with the goal of creating a more natural man/machine interface. The single largest obstacle to widespread use of ASR technology is robustness to noise. Since human speech recognition greatly outperforms current ASR systems in noisy environments, ASR systems seek to improve noise robustness by drawing on biological inspiration. Most ASR front ends employ mel frequency cepstral coefficients (mfcc) which is a filter bank-based algorithm whose filters are spaced on a linear-log frequency scale. Although center frequency is based on a perceptually motivated frequency scale, filter bandwidth is set by filter spacing and not through biological motivation. The coupling of filter bandwidth to other filter bank parameters (frequency range, number of filters) has led to variations of the original algorithm with different filter bandwidths. In this work, a novel extension to mfcc is introduced which decouples filter bandwidth from the rest of the filter bank parameters by employing the relationship between filter center frequency and critical bandwidth of the human auditory system. The new algorithm, called human factor cepstral coefficients (hfcc), is shown to outperform the original mfcc and two popular variations in several ASR experiments and noise sources.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human factor cepstral coefficients

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Oct 25, 2002
Citations: 28

Similar Papers

Vector Taylor series based HMM adaptation for generalized cepstrum in noisy environment
Soonho Baek ... Hong-Goo Kang
-
Soonho Baek, et. al.Soonho Baek ... Hong-Goo Kang
01 Dec 2013
01 Dec 2013

A Robust Speech Recognition System for Communication Robots in Noisy Environments
Carlos Toshinori Ishi ... Takatoshi Jitsuhiro
IEEE Transactions on Robotics | VOL. 24
Carlos Toshinori Ishi, et. al.Carlos Toshinori Ishi ... Takatoshi Jitsuhiro
01 Jun 2008
IEEE Transactions on Robotics | VOL. 24

Performance evaluation of Hindi speech recognition system using optimized filterbanks
Mohit Dua ... Mantosh Biswas
Engineering Science and Technology, an International Journal | VOL. 21
Mohit Dua, et. al.Mohit Dua ... Mantosh Biswas
16 Apr 2018
Engineering Science and Technology, an International Journal | VOL. 21

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human factor cepstral coefficients

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America