Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages

Wiehan Agenbag,Thomas Niesler

doi:10.1016/j.csl.2019.02.002

Abstract

We present a method enabling the unsupervised discovery of sub-word units (SWUs) and associated pronunciation lexicons for use in automatic speech recognition (ASR) systems. This includes a novel SWU discovery approach based on self-organising HMM-GMM states that are agglomeratively tied across words as well as a novel pronunciation lexicon induction approach that iteratively reduces pronunciation variation by means of model pruning. Our approach relies only on recorded speech and associated orthographic transcriptions and does not require alphabetic graphemes. We apply our methods to corpora of recorded radio broadcasts in Ugandan English, Luganda and Acholi, of which the latter two are under-resourced. The speech is conversational and contains high levels of background noise, and therefore presents a challenge to automatic lexicon induction. We demonstrate that our proposed method is able to discover lexicons that perform as well as baseline expert systems for Acholi, and close to this level for the other two languages when used to train DNN-HMM ASR systems. This demonstrates the potential of the method to enable and accelerate ASR for under-resourced languages for which a phone inventory and pronunciation lexicon are not available by eliminating the dependence on human expertise this usually requires.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages

Abstract

Talk to us

Similar Papers

More From: Computer Speech & Language

Lead the way for us

Journal: Computer Speech & Language	Publication Date: Feb 21, 2019
Citations: 7

Similar Papers

Using Auxiliary Sources of Knowledge for Automatic Speech Recognition

-

01 Jan 2004
01 Jan 2004

A Comparative Study on Selecting Acoustic Modeling Units for WFST-based Mongolian Speech Recognition
Wang Yonghe ... Feilong Bao
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Wang Yonghe, et. al.Wang Yonghe ... Feilong Bao
13 Oct 2023
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Enhancements in automatic Kannada speech recognition system by background noise elimination and alternate acoustic modelling
G Thimmaraja Yadava ... H S Jayanna
International Journal of Speech Technology | VOL. 23
G Thimmaraja Yadava, et. al.G Thimmaraja Yadava ... H S Jayanna
22 Jan 2020
International Journal of Speech Technology | VOL. 23

Writing with automatic speech recognition: Examining user’s behaviours and text quality (lexical diversity)
Walcir Cardoso ... Danial Mehdipour-Kolour
-
Walcir Cardoso, et. al.Walcir Cardoso ... Danial Mehdipour-Kolour
15 Aug 2023
15 Aug 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic sub-word unit discovery and pronunciation lexicon induction for ASR with application to under-resourced languages

Abstract

Talk to us

Similar Papers

More From: Computer Speech &amp; Language

More From: Computer Speech & Language