Matrix sentence intelligibility prediction using an automatic speech recognition system

Marc René Schädler,Anna Warzybok,Sabine Hochmuth,Birger Kollmeier

doi:10.3109/14992027.2015.1061708

Abstract

Objective: The feasibility of predicting the outcome of the German matrix sentence test for different types of stationary background noise using an automatic speech recognition (ASR) system was studied. Design: Speech reception thresholds (SRT) of 50% intelligibility were predicted in seven noise conditions. The ASR system used Mel-frequency cepstral coefficients as a front-end and employed whole-word Hidden Markov models on the back-end side. The ASR system was trained and tested with noisy matrix sentences on a broad range of signal-to-noise ratios. Study sample: The ASR-based predictions were compared to data from the literature (Hochmuth et al, 2015) obtained with 10 native German listeners with normal hearing and predictions of the speech intelligibility index (SII). Results: The ASR-based predictions showed a high and significant correlation (R² = 0.95, p < 0.001) with the empirical data across different noise conditions, outperforming the SII-based predictions which showed no correlation with the empirical data (R² = 0.00, p = 0.987). Conclusions: The SRTs for the German matrix test for listeners with normal hearing in different stationary noise conditions could well be predicted based on the acoustical properties of the speech and noise signals. Minimum assumptions were made about human speech processing already incorporated in a reference-free ordinary ASR system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Matrix sentence intelligibility prediction using an automatic speech recognition system

Abstract

Talk to us

Similar Papers

More From: International Journal of Audiology

Lead the way for us

Journal: International Journal of Audiology	Publication Date: May 1, 2015
Citations: 54

Similar Papers

Speech Perception With Combined Electric-Acoustic Stimulation: A Simulation and Model Comparison.
Tobias Rader ... Uwe Baumann
Ear & Hearing | VOL. 36
Tobias Rader, et. al.Tobias Rader ... Uwe Baumann
01 Nov 2015
Ear & Hearing | VOL. 36

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

The Benefit Obtained from Visually Displayed Text from an Automatic Speech Recognizer During Listening to Speech Presented in Noise
Adriana A Zekveld ... Tammo Houtgast
Ear & Hearing | VOL. 29
Adriana A Zekveld, et. al.Adriana A Zekveld ... Tammo Houtgast
01 Dec 2008
Ear & Hearing | VOL. 29

The Influence of Age, Hearing, and Working Memory on the Speech Comprehension Benefit Derived from an Automatic Speech Recognition System
Adriana A Zekveld ... Sophia E Kramer
Ear & Hearing | VOL. 30
Adriana A Zekveld, et. al.Adriana A Zekveld ... Sophia E Kramer
01 Apr 2009
Ear & Hearing | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Matrix sentence intelligibility prediction using an automatic speech recognition system

Abstract

Talk to us

Similar Papers

More From: International Journal of Audiology