A cross-language study of speech recognition systems for English, German, and Hebrew

Vered Silber Varod,Yamini Sinha,Ingo Siegert,Nitza Geri,Oliver Jokisch

doi:10.36965/ojakm.2021.9(1)1-15

Abstract

Despite the growing importance of Automatic Speech Recognition (ASR), its application is still challenging, limited, language-dependent, and requires considerable resources. The resources required for ASR are not only technical, they also need to reflect technological trends and cultural diversity. The purpose of this research is to explore ASR performance gaps by a comparative study of American English, German, and Hebrew. Apart from different languages, we also investigate different speaking styles – utterances from spontaneous dialogues and utterances from frontal lectures (TED-like genre). The analysis includes a comparison of the performance of four ASR engines (Google Cloud, Google Search, IBM Watson, and WIT.ai) using four commonly used metrics: Word Error Rate (WER); Character Error Rate (CER); Word Information Lost (WIL); and Match Error Rate (MER). As expected, findings suggest that English ASR systems provide the best results. Contrary to our hypothesis regarding ASR’s low performance for under-resourced languages, we found that the Hebrew and German ASR systems have similar performance. Overall, our findings suggest that ASR performance is language-dependent and system-dependent. Furthermore, ASR may be genre-sensitive, as our results showed for German. This research contributes a valuable insight for improving ubiquitous global consumption and management of knowledge and calls for corporate social responsibility of commercial companies, to develop ASR under Fair, Reasonable, and Non-Discriminatory (FRAND) terms

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Online Journal of Applied Knowledge Management	Publication Date: Jul 26, 2021
Citations: 1	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

A cross-language study of speech recognition systems for English, German, and Hebrew

Abstract

Talk to us

Similar Papers

More From: Online Journal of Applied Knowledge Management

Lead the way for us

Similar Papers

Non-native pronunciation variation modeling using an indirect data driven method
Mina Kim ... Yoo Rhee Oh
-
Mina Kim, et. al. Mina Kim ... Yoo Rhee Oh
01 Jan 2007
01 Jan 2007

Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems
Kartik Audhkhasi ... Shrikanth S Narayanan
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Shrikanth S Narayanan
01 Mar 2014
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 22

Customized deep learning based Turkish automatic speech recognition system supported by language model.
Yasin Görmez
PeerJ Computer Science | VOL. 10
Yasin GörmezYasin Görmez
03 Apr 2024
PeerJ Computer Science | VOL. 10

"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?
Brian D Tran ... Jennifer Elston Lafata
Journal of the American Medical Informatics Association | VOL. 30
Brian D Tran, et. al.Brian D Tran ... Jennifer Elston Lafata
23 Jan 2023
Journal of the American Medical Informatics Association | VOL. 30

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A cross-language study of speech recognition systems for English, German, and Hebrew

Abstract

Talk to us

Similar Papers

More From: Online Journal of Applied Knowledge Management