Tigrinya Automatic Speech recognition with Morpheme based recognition units

Hafte Abera,Sebsibe Hailemariam

doi:10.18653/v1/2020.winlp-1.12

Abstract

The Tigrinya language is agglutinative and has a large number of inflected and derived forms of words. Therefore a Tigrinya large vocabulary continuous speech recognition system often has a large number of different units and a high out-of-vocabulary (OOV) rate if a word is used as a recognition unit of a language model (LM) and lexicon. Therefore a morpheme-based approach has often been used and a morpheme is used as the recognition unit to reduce the high OOV rate. This paper presents an automatic speech recognition experiment conducted to see the effect of OOV words on the performance speech recognition system for Tigrinya. We tried to solve the OOV problem by using morphemes as lexicon and language model units. It has been found that the morpheme-based recognition system is better lexical and language modeling units than words. An absolute improvement (in word recognition accuracy) of 3.45 token and 8.36 types has been obtained as a result of using a morph-based vocabulary.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Tigrinya Automatic Speech recognition with Morpheme based recognition units

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic
Martha Yifiru Tachbelie ... Laurent Besacier
Speech Communication | VOL. 56
Martha Yifiru Tachbelie, et. al.Martha Yifiru Tachbelie ... Laurent Besacier
14 Feb 2013
Speech Communication | VOL. 56

Lexical units for Thai LVCSR
Markpong Jongtaveesataporn ... Sadaoki Furui
Speech Communication | VOL. 51
Markpong Jongtaveesataporn, et. al.Markpong Jongtaveesataporn ... Sadaoki Furui
11 Dec 2008
Speech Communication | VOL. 51

Discrete-Mixture HMMs-based Approach for Noisy Speech Recognition
Tetsuo Kosaka ... Masaki Koh
-
Tetsuo Kosaka, et. al.Tetsuo Kosaka ... Masaki Koh
01 Jun 2007
01 Jun 2007

Automatic diagnosis of recognition errors in large vocabulary continuous speech recognition systems
Hiroaki Nanjo ... Tatsuya Kawahara
-
Hiroaki Nanjo, et. al.Hiroaki Nanjo ... Tatsuya Kawahara
16 Oct 2000
16 Oct 2000

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Tigrinya Automatic Speech recognition with Morpheme based recognition units

Abstract

Talk to us

Similar Papers