Performance of LVCSR with morpheme-based and syllable-based recognition units

Oh-Wook Kwon Oh-Wook Kwon

doi:10.1109/icassp.2000.861974

Abstract

For large vocabulary continuous speech recognition of highly inflected languages, it is the first step to determine an appropriate speech recognition unit to reduce high out-of-vocabulary rate. We investigate two kinds of approaches to select recognition units. In the morpheme-based approach, we use morpheme as basic recognition unit and merge frequent morpheme pairs into phrases by rule-based method or statistical unit merging method. In statistical unit merging, we investigate the effects of part-of-speech constraints used in selecting merging candidates. In the syllable-based approach, assuming that only text data and pronunciation are available, we obtain merged syllables by using the same statistical merging method where pronunciation variation is taken into account. The experimental results showed that the statistical merging method with appropriate linguistic constraints yields best recognition accuracy. Although the syllable-based approach did not show comparable performance, it has the advantage that it does not require a part-of-speech tagging system.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Performance of LVCSR with morpheme-based and syllable-based recognition units

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Korean large vocabulary continuous speech recognition with morpheme-based recognition units
Oh-Wook Kwon ... Jun Park
Speech Communication | VOL. 39
Oh-Wook Kwon, et. al.Oh-Wook Kwon ... Jun Park
04 Mar 2002
Speech Communication | VOL. 39

Tigrinya Automatic Speech recognition with Morpheme based recognition units
Hafte Abera ... Sebsibe Hailemariam
-
Hafte Abera, et. al.Hafte Abera ... Sebsibe Hailemariam
01 Jan 2020
01 Jan 2020

Morpheme concatenation approach in language modeling for large-vocabulary Uyghur speech recognition
Mijit Ablimit ... Tatsuya Kawahara
-
Mijit Ablimit, et. al.Mijit Ablimit ... Tatsuya Kawahara
01 Oct 2011
01 Oct 2011

A usage of the syllable unit based on morphological statistics in Korean large vocabulary continuous speech recognition system
Hyok-Chol Ri
International Journal of Speech Technology | VOL. 22
Hyok-Chol RiHyok-Chol Ri
25 Sep 2019
International Journal of Speech Technology | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance of LVCSR with morpheme-based and syllable-based recognition units

Abstract

Talk to us

Similar Papers