Human phoneme recognition depending on speech-intrinsic variability

Bernd T Meyer,Thorsten Wesker,Birger Kollmeier,Thomas Brand,Tim Jürgens

doi:10.1121/1.3493450

Abstract

The influence of different sources of speech-intrinsic variation (speaking rate, effort, style and dialect or accent) on human speech perception was investigated. In listening experiments with 16 listeners, confusions of consonant-vowel-consonant (CVC) and vowel-consonant-vowel (VCV) sounds in speech-weighted noise were analyzed. Experiments were based on the OLLO logatome speech database, which was designed for a man-machine comparison. It contains utterances spoken by 50 speakers from five dialect/accent regions and covers several intrinsic variations. By comparing results depending on intrinsic and extrinsic variations (i.e., different levels of masking noise), the degradation induced by variabilities can be expressed in terms of the SNR. The spectral level distance between the respective speech segment and the long-term spectrum of the masking noise was found to be a good predictor for recognition rates, while phoneme confusions were influenced by the distance to spectrally close phonemes. An analysis based on transmitted information of articulatory features showed that voicing and manner of articulation are comparatively robust cues in the presence of intrinsic variations, whereas the coding of place is more degraded. The database and detailed results have been made available for comparisons between human speech recognition (HSR) and automatic speech recognizers (ASR).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Human phoneme recognition depending on speech-intrinsic variability

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Nov 1, 2010
Citations: 30

Similar Papers

Phoneme confusions in human and automatic speech recognition
Bernd T Meyer ... Birger Kollmeier
-
Bernd T Meyer, et. al.Bernd T Meyer ... Birger Kollmeier
27 Aug 2007
27 Aug 2007

Reaching over the gap: A review of efforts to link human and automatic speech recognition research
Odette Scharenborg
Speech Communication | VOL. 49
Odette ScharenborgOdette Scharenborg
03 Feb 2007
Speech Communication | VOL. 49

Comparing human and automatic speech recognition in simple and complex acoustic scenes
Constantin Spille ... Bernd T Meyer
Computer Speech & Language | VOL. 52
Constantin Spille, et. al.Constantin Spille ... Bernd T Meyer
14 Apr 2018
Computer Speech & Language | VOL. 52

Robustness of spectro-temporal features against intrinsic and extrinsic variations in automatic speech recognition
Bernd T Meyer ... Birger Kollmeier
Speech Communication | VOL. 53
Bernd T Meyer, et. al.Bernd T Meyer ... Birger Kollmeier
24 Jul 2010
Speech Communication | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Human phoneme recognition depending on speech-intrinsic variability

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America