Development of Speech Recognition Systems in Emergency Call Centers

Alakbar Valizada,Samir Rustamov,Natavan Akhundova

doi:10.3390/sym13040634

Abstract

In this paper, various methodologies of acoustic and language models, as well as labeling methods for automatic speech recognition for spoken dialogues in emergency call centers were investigated and comparatively analyzed. Because of the fact that dialogue speech in call centers has specific context and noisy, emotional environments, available speech recognition systems show poor performance. Therefore, in order to accurately recognize dialogue speeches, the main modules of speech recognition systems—language models and acoustic training methodologies—as well as symmetric data labeling approaches have been investigated and analyzed. To find an effective acoustic model for dialogue data, different types of Gaussian Mixture Model/Hidden Markov Model (GMM/HMM) and Deep Neural Network/Hidden Markov Model (DNN/HMM) methodologies were trained and compared. Additionally, effective language models for dialogue systems were defined based on extrinsic and intrinsic methods. Lastly, our suggested data labeling approaches with spelling correction are compared with common labeling methods resulting in outperforming the other methods with a notable percentage. Based on the results of the experiments, we determined that DNN/HMM for an acoustic model, trigram with Kneser–Ney discounting for a language model and using spelling correction before training data for a labeling method are effective configurations for dialogue speech recognition in emergency call centers. It should be noted that this research was conducted with two different types of datasets collected from emergency calls: the Dialogue dataset (27 h), which encapsulates call agents’ speech, and the Summary dataset (53 h), which contains voiced summaries of those dialogues describing emergency cases. Even though the speech taken from the emergency call center is in the Azerbaijani language, which belongs to the Turkic group of languages, our approaches are not tightly connected to specific language features. Hence, it is anticipated that suggested approaches can be applied to the other languages of the same group.

Highlights

IntroductionThe job of call center agents can be overwhelming considering that they have to both talk on the phone and log the case information at the same time
This paper proposed an investigation of different models and methods for Automatic Speech Recognition (ASR) in emergency call centers especially for improving the quality and performance of ASR for the Azerbaijani language
We showed how the suggested data labeling approach with spelling correction result in outperforming other methods by a notable percentage

Summary

Introduction

The job of call center agents can be overwhelming considering that they have to both talk on the phone and log the case information at the same time. There are many applications designed to ease the job for call center agents. They can be connected via internet telephony, can include interactive voice response and other technologies to ensure a smooth experience. New opportunities arose when Artificial Intelligence (AI) started to be applied to a lot of spheres of industry, as well as call centers. With cutting-edge AI technologies like speech recognition, call centers can benefit from the automation of processes

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Symmetry	Publication Date: Apr 9, 2021
Citations: 10	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Development of Speech Recognition Systems in Emergency Call Centers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry

Lead the way for us

Similar Papers

Unsupervised cross-adaptation using language model and deep learning based acoustic model adaptations
Akira Takagi ... Tetsuo Kosaka
-
Akira Takagi, et. al.Akira Takagi ... Tetsuo Kosaka
01 Dec 2014
01 Dec 2014

A Hybrid Hidden Markov Model for Pipeline Leakage Detection
Mingchi Zhang ... Wei Li
Applied Sciences | VOL. 11
Mingchi Zhang, et. al.Mingchi Zhang ... Wei Li
01 Apr 2021
Applied Sciences | VOL. 11

Phonotactic language recognition based on DNN-HMM acoustic model
Wei-Wei Liu ... Hua Yuan
-
Wei-Wei Liu, et. al.Wei-Wei Liu ... Hua Yuan
01 Sep 2014
01 Sep 2014

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation
Do Van Hai ... Haihua Xu
-
Do Van Hai, et. al.Do Van Hai ... Haihua Xu
01 Dec 2015
Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation
Do Van Hai ... Haihua Xu

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Development of Speech Recognition Systems in Emergency Call Centers

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Symmetry