Recognition of continuously spoken letters by listeners and spectrogram readers

Nancy A Daly,Victor W Zue

doi:10.1121/1.2025410

Abstract

Because of acoustic similarities between some letters of the alphabet, automatic recognition of continuously spoken letters is a difficult task. The goal of this study is to determine and compare how well listeners and spectrogram readers can recognize continuously spoken letter strings from multiple speakers. The interest in spectrogram reading results is motivated by the belief that this procedure may help to identify acoustic attributes and decision strategies that are useful for system implementation. Listening and spectrogram reading tests involving eight listeners and six spectrogram readers, respectively, were conducted using a corpus of 1000 wordlike strings designed to minimize the use of lexical knowledge. Results show that listeners' performance was better than readers' (98.4% vs 91.0%). In both experiments, string lengths were determined very accurately (98.1% and 96.2%), presumably due to the large number of glottal stops inserted at letter boundaries to facilitate segmentation. Most of the errors were due to substitution of one letter for another (68% and 92%), and they generally fall into two categories. Asymmetric errors can often be attributed to subjects' disregard for contextual influence, whereas symmetric errors are largely due to acoustic similarities between certain letter pairs. Subsequent acoustic study of four of the most confusable letter pairs has resulted in the identification of a number of distinguishing acoustic attributes. Using these attributes, overall recognition performance better than that of the readers was achieved. [Work supported by NSF and DARPA under contract N00014-82-K-0727, monitored through the Office of Naval Research.]

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Recognition of continuously spoken letters by listeners and spectrogram readers

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

A knowledge-based system for stop consonant identification based on speech spectrogram reading
Lori F Lamel
Computer Speech & Language | VOL. 7
Lori F LamelLori F Lamel
01 Apr 1993
Computer Speech & Language | VOL. 7

Adaptively truncated maximum likelihood regression with asymmetric errors
Alfio Marazzi ... Victor J Yohai
Journal of Statistical Planning and Inference | VOL. 122
Alfio Marazzi, et. al.Alfio Marazzi ... Victor J Yohai
06 Sep 2003
Journal of Statistical Planning and Inference | VOL. 122

КОДЫ БЕРГЕРА В СХЕМАХ ВСТРОЕННОГО КОНТРОЛЯ, РЕАЛИЗОВАННЫХ НА ОСНОВЕ МЕТОДА ЛОГИЧЕСКОГО ДОПОЛНЕНИЯ
D.V Efanov ... M.V Zueva
Informatika i sistemy upravleniya | VOL. -
D.V Efanov, et. al.D.V Efanov ... M.V Zueva
01 Jan 2020
Informatika i sistemy upravleniya | VOL. -

A class of M-ary asymmetric symbol error correcting codes for data entry devices
H Kaneko ... E Fujiwara
IEEE Transactions on Computers | VOL. 53
H Kaneko, et. al.H Kaneko ... E Fujiwara
01 Feb 2004
IEEE Transactions on Computers | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Recognition of continuously spoken letters by listeners and spectrogram readers

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America