A distance measure for speech recognition based on an FM‐neuron model

Kiyoaki Aikawa,Sadaoki Furui

doi:10.1002/ecjc.4430741210

Abstract

AbstractThis paper discusses the speech recognition based on the time course of the local peak of the spectrum such as the formant, which has been considered important in the phoneme perception. A measure for the dynamical behavior of the spectrum is proposed based on the functional model of the FM‐neuron which is shown to exist in auditory physiology.First, the FM‐neuron is modeled as a time‐frequency filter for the spectral time‐series which responds only to the shift of the local peak frequency of the spectrum with the discrimination function for the shift direction. Then the measure to represent the difference of the output from the FM‐neuron model is derived based on the cepstral expansion of the spectrum. The measure is called the spectral movement similarity.It is shown that the spectral movement similarity on the auditory nonlinear frequency axis can be realized equivalently by the frequency weighting. A spoken word recognition experiment is conducted employing the dynamic time warping (DTW) using the spectral movement similarity. It is shown also that the recognition error is reduced greatly by combining the proposed measure and the traditional spectral distance compared to the case where only the traditional spectral distance is used. This improvement is more remarkable when the cepstral distance is used as the spectral distance, with the recognition error being reduced to one‐fourth.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A distance measure for speech recognition based on an FM‐neuron model

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part III: Fundamental Electronic Science)

Lead the way for us

Similar Papers

A novel quantitative EEG injury measure of global cerebral ischemia
R.G Geocadin ... N.V Thakor
Clinical Neurophysiology | VOL. 111
R.G Geocadin, et. al.R.G Geocadin ... N.V Thakor
28 Sep 2000
Clinical Neurophysiology | VOL. 111

A weighted cepstral distance measure for speech recognition
Y Tohkura
IEEE Transactions on Acoustics, Speech, and Signal Processing | VOL. 35
Y TohkuraY Tohkura
01 Oct 1987
IEEE Transactions on Acoustics, Speech, and Signal Processing | VOL. 35

The spectral similarity scale and its application to the classification of hyperspectral remote sensing data
J.N Sweet
-
J.N SweetJ.N Sweet
27 Oct 2003
27 Oct 2003

A weighted cepstral distance measure for speech recognition
Y Tohkura
-
Y TohkuraY Tohkura
01 Apr 1986
01 Apr 1986

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A distance measure for speech recognition based on an FM‐neuron model

Abstract

Talk to us

Similar Papers

More From: Electronics and Communications in Japan (Part III: Fundamental Electronic Science)