Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition

Donghyun Kim,Sanghun Kim,Damheo Lee,Seung Yun

doi:10.3390/app11062866

Donghyun Kim, Sanghun Kim + Show 2 more

Open Access

https://doi.org/10.3390/app11062866

Copy DOI

Abstract

In this paper, we propose a new method for code-switching (CS) automatic speech recognition (ASR) in Korean. First, the phonetic variations in English pronunciation spoken by Korean speakers should be considered. Thus, we tried to find a unified pronunciation model based on phonetic knowledge and deep learning. Second, we extracted the CS sentences semantically similar to the target domain and then applied the language model (LM) adaptation to solve the biased modeling toward Korean due to the imbalanced training data. In this experiment, training data were AI Hub (1033 h) in Korean and Librispeech (960 h) in English. As a result, when compared to the baseline, the proposed method improved the error reduction rate (ERR) by up to 11.6% with phonetic variant modeling and by 17.3% when semantically similar sentences were applied to the LM adaptation. If we considered only English words, the word correction rate improved up to 24.2% compared to that of the baseline. The proposed method seems to be very effective in CS speech recognition.

Highlights

Automatic speech recognition (ASR) and speech translation (ST) based on end-to-end (E2E) frameworks have shown significant improvements
In the case of Korean, English words pronounced by Korean speakers—Korean-style English (i.e., Konglish)—have many phonetic variations from native-like English pronunciation
To simultaneously avoid the data imbalance and low resources of CS, in this paper, we propose a hybrid method based on phonetic knowledge and deep learning, which integrates Korean and English data

Summary

Introduction

Automatic speech recognition (ASR) and speech translation (ST) based on end-to-end (E2E) frameworks have shown significant improvements. These systems have been widely adapted to real-life situations, such as lectures, business meetings, and human– machine conversations. To figure out the effect of CS, we investigated how often Korean sentences have English words. These problems can be categorized into two types: the inter-sentential, where language transitions occur at the phrase, sentence, or discourse boundaries; and the.

Related Work

Phonetic Variant Modeling

Phoneme Mapping Using Phonetic Knowledge

Applying the Korean–Konglish Mixed Model

Applying Domain Adaptation Using Shallow Fusion

Findings

Conclusions

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied Sciences	Publication Date: Mar 23, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Dynamic language modeling for broadcast news
Jean-Luc Gauvain ... Lori Lamel
-
Jean-Luc Gauvain, et. al.Jean-Luc Gauvain ... Lori Lamel
04 Oct 2004
04 Oct 2004

Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition
Haobo Zhang ... Eng Siong Chng
-
Haobo Zhang, et. al.Haobo Zhang ... Eng Siong Chng
25 Oct 2020
25 Oct 2020

Improving N-Best Rescoring in Under-Resourced Code-Switched Speech Recognition Using Pretraining and Data Augmentation
Thomas Niesler ... Joshua Jansen Van Vüren
Languages | VOL. 7
Thomas Niesler, et. al.Thomas Niesler ... Joshua Jansen Van Vüren
13 Sep 2022
Languages | VOL. 7

Progressive language model adaptation for disaster broadcasting with closed-captions
Yuya Fujita ... Takahiro Oku
-
Yuya Fujita, et. al.Yuya Fujita ... Takahiro Oku
01 Oct 2013
01 Oct 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Phonetic Variation Modeling and a Language Model Adaptation for Korean English Code-Switching Speech Recognition

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences