Evaluation of the smoothed group delay spectrum distance measure for speaker‐dependent speech recognition

Taizo Umezaki,Fumitada Itakura

doi:10.1002/ecjc.4430740401

Abstract

AbstractThis paper evaluates first the smoothed group delay spectrum (SGDS) distance measure through the isolated work speech recognition experiment by specified speakers. The experiment was performed for the following three cases, considering the speech recognition in the actual environment: 1) the case where the channels have difference characteristics; 2) the case where a white noise is added to the input speech; and 3) the case where the telephone speech is used as the input.In all three cases, the recognition rate is improved drastically compared to the traditional LPC cepstrum distance measure. An improvement of the recognition rate by 16 percent was realized under the noise of segmental SN ratio 20 dB. Then the distance measure is evaluated for the case where the FFT cepstrum is converted into the group delay spectrum.The proposed method gives a better recognition rate compared to the conventional FFT cepstrum distance measure, but the result is worse than the SGDA measure by approximately 3 percent since the higher‐order FFT cepstrum coefficient has a larger variance on the time axis.Finally, the SGDS distance measure is evaluated by the isolated word speech recognition system with the monosyllable as the registered speech. The vowel recognition rate is improved, which improved the recognition rates for the syllable and the word by 2 percent or more on a relative scale.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluation of the smoothed group delay spectrum distance measure for speaker‐dependent speech recognition

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part III: Fundamental Electronic Science)

Lead the way for us

Journal: Electronics and Communications in Japan (Part III: Fundamental Electronic Science)	Publication Date: Jan 1, 1991
Citations: 1

Similar Papers

Fractional Lower-order Statistics for Yangzhou Dialectal Speech Recognition
Huimin Lu ... Yujie Li
-
Huimin Lu, et. al.Huimin Lu ... Yujie Li
01 Jan 2015
01 Jan 2015

스마트 홈 환경에서 사용자 상황정보 기반의 음성 인식 시스템 개발
Jong-Hun Kim ... Jae-Ho Sim
The Journal of the Korea Contents Association | VOL. 8
Jong-Hun Kim, et. al.Jong-Hun Kim ... Jae-Ho Sim
28 Jan 2008
The Journal of the Korea Contents Association | VOL. 8

Audio Visual Technique for Enhancing the Isolated Word Speech Recognition System
...
International Journal of Advanced Research in Computer Science | VOL. 8
, et. al. ...
30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Isolated Word Speech Recognition System Based On FPGA
Xiaohui Hu ... Weixing Zhou
Journal of Computers | VOL. 8
Xiaohui Hu, et. al.Xiaohui Hu ... Weixing Zhou
12 Jan 2013
Journal of Computers | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of the smoothed group delay spectrum distance measure for speaker‐dependent speech recognition

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part III: Fundamental Electronic Science)