Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages

Van Hai Do,Eng Siong Chng,Haizhou Li,Xiong Xiao

doi:10.1587/transinf.e97.d.285

Abstract

SUMMARY This paper presents a novel acoustic modeling technique of large vocabulary automatic speech recognition for under-resourced languages by leveraging well-trained acoustic models of other languages (called source languages). The idea is to use source language acoustic model to score the acoustic features of the target language, and then map these scores to the posteriors of the target phones using a classifier. The target phone posteriors are then used for decoding in the usual way of hybrid acoustic modeling. The motivation of such a strategy is that human languages usually share similar phone sets and hence it may be easier to predict the target phone posteriors from the scores generated by source language acoustic models than to train from scratch an under-resourced language acoustic model. The proposed method is evaluated using on the Aurora-4 task with less than 1 hour of training data. Two types of source language acoustic models are considered, i.e. hybrid HMM/MLP and conventional HMM/GMM models. In addition, we also use triphone tied states in the mapping. Our experimental results show that by leveraging well trained Malay and Hungarian acoustic models, we achieved 9.0% word error rate (WER) given 55 minutes of English training data. This is close to the WER of 7.9% obtained by using the full 15 hours of training data and much better than the WER of 14.4% obtained by conventional acoustic modeling techniques with the same 55 minutes of training data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Information and Systems	Publication Date: Jan 1, 2014
Citations: 25	License type: free

R Discovery Prime

R Discovery Prime

Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Similar Papers

An Investigation of Multilingual TDNN-BLSTM Acoustic Modeling for Hindi Speech Recognition
Ankit Kumar ... Rajesh Kumar Aggarwal
International Journal of Sensors, Wireless Communications and Control | VOL. 12
Ankit Kumar, et. al.Ankit Kumar ... Rajesh Kumar Aggarwal
01 Jan 2021
International Journal of Sensors, Wireless Communications and Control | VOL. 12

Croatian Large Vocabulary Automatic Speech Recognition
...
Automatika | VOL. 52
, et. al. ...
18 Jan 2017
Automatika | VOL. 52

Croatian Large Vocabulary Automatic Speech Recognition
Sandaasst Prof Martinčić-Ipšić ... Ivoprof Ipšić
Automatika | VOL. 52
Sandaasst Prof Martinčić-Ipšić, et. al.Sandaasst Prof Martinčić-Ipšić ... Ivoprof Ipšić
01 Jan 2010
Automatika | VOL. 52

End-to-end automated speech recognition using a character based small scale transformer architecture
Alexander Loubser ... Allan De Freitas
Expert Systems With Applications | VOL. 252
Alexander Loubser, et. al.Alexander Loubser ... Allan De Freitas
01 May 2024
Expert Systems With Applications | VOL. 252

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-Lingual Phone Mapping for Large Vocabulary Speech Recognition of Under-Resourced Languages

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems