A spike-based computational model for noise robust vowel classification

Ismail Uysal,John G Harris,Harsha Sathyendra

doi:10.1121/1.4786421

Abstract

The human ability to recognize speech drastically outperforms that of commercial ASR systems especially in noisy environments. Presently, there is limited knowledge of the auditory system dynamics, however it is known that coding and processing of information is carried out via action potentials. This research aims to better understand the coding mechanisms along the auditory pathway, while devising a noise robust system for speech recognition. A biologically plausible algorithm for vowel classification is proposed, which solely uses spikes for both the feature extraction and the classification stages. The algorithm uses an improved and adaptive model of the inner-hair cell [Sumner et al., J. Acoust. Soc. Am. 113, 893–901 (2003)] to generate spike trains at different characteristic frequencies. The synchrony among the hair cells is used as a noise robust means for feature extraction. Detected features are then classified using a spike-based rank order coder, which uses the spike arrival times to the postsynaptic neuron to encode information [Delorme and Thorpe, Neural Networks. 14, 795–803 (2001)]. Experiments on a noisy vowel dataset (5 dB SNR) show an average of 15% increase in the recognition rate for the prototype system when compared to a nearest-neighbor classifier employing Mel frequency cepstral coefficients.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A spike-based computational model for noise robust vowel classification

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

How Hearing Happens
A.J Hudspeth
Neuron | VOL. 19
A.J HudspethA.J Hudspeth
01 Nov 1997
Neuron | VOL. 19

Editor's evaluation: The Cl--channel TMEM16A is involved in the generation of cochlear Ca2+ waves and promotes the refinement of auditory brainstem networks in mice
Marla B Feller
-
Marla B FellerMarla B Feller
07 Oct 2021
07 Oct 2021

Decision letter: The Cl--channel TMEM16A is involved in the generation of cochlear Ca2+ waves and promotes the refinement of auditory brainstem networks in mice
Marla B Feller ... Andrew J King
-
Marla B Feller, et. al.Marla B Feller ... Andrew J King
07 Oct 2021
07 Oct 2021

Is there an unmet medical need for improved hearing restoration?
Bettina Julia Wolf ... Tobias Moser
EMBO Molecular Medicine | VOL. 14
Bettina Julia Wolf, et. al.Bettina Julia Wolf ... Tobias Moser
14 Jul 2022
EMBO Molecular Medicine | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A spike-based computational model for noise robust vowel classification

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America