Adapting Off-the-Shelf Speech Recognition Systems for Novel Words

Wiam Fadel,Toumi Bouchentouf,Omar Bourja,Pierre-André Buvet

doi:10.3390/info14030179

Abstract

Current speech recognition systems with fixed vocabularies have difficulties recognizing Out-of-Vocabulary words (OOVs) such as proper nouns and new words. This leads to misunderstandings or even failures in dialog systems. Ensuring effective speech recognition is crucial for the proper functioning of robot assistants. Non-native accents, new vocabulary, and aging voices can cause malfunctions in a speech recognition system. If this task is not executed correctly, the assistant robot will inevitably produce false or random responses. In this paper, we used a statistical approach based on distance algorithms to improve OOV correction. We developed a post-processing algorithm to be combined with a speech recognition model. In this sense, we compared two distance algorithms: Damerau–Levenshtein and Levenshtein distance. We validated the performance of the two distance algorithms in conjunction with five off-the-shelf speech recognition models. Damerau–Levenshtein, as compared to the Levenshtein distance algorithm, succeeded in minimizing the Word Error Rate (WER) when using the MoroccanFrench test set with five speech recognition systems, namely VOSK API, Google API, Wav2vec2.0, SpeechBrain, and Quartznet pre-trained models. Our post-processing method works regardless of the architecture of the speech recognizer, and its results on our MoroccanFrench test set outperformed the five chosen off-the-shelf speech recognizer systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Adapting Off-the-Shelf Speech Recognition Systems for Novel Words

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Journal: Information	Publication Date: Mar 13, 2023
License type: CC BY 4.0

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Error analysis to improve the speech recognition accuracy on Telugu language
N Usha Rani ... P N Girija
Sadhana | VOL. 37
N Usha Rani, et. al.N Usha Rani ... P N Girija
01 Dec 2012
Sadhana | VOL. 37

Continuous Speech Recognition Technologies—A Review
Shobha Bhatt ... Amita Dev
-
Shobha Bhatt, et. al.Shobha Bhatt ... Amita Dev
20 Sep 2020
20 Sep 2020

Customized deep learning based Turkish automatic speech recognition system supported by language model.
Yasin Görmez
PeerJ Computer Science | VOL. 10
Yasin GörmezYasin Görmez
03 Apr 2024
PeerJ Computer Science | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Adapting Off-the-Shelf Speech Recognition Systems for Novel Words

Abstract

Talk to us

Similar Papers

More From: Information