N-gram adaptation using Dirichlet class language model based on part-of-speech for speech recognition

Ali Hatami,Ahmad Akbari,Babak Nasersharif

doi:10.1109/iraniancee.2013.6599642

Abstract

Language model plays an important role in automatic speech recognition (ASR) systems. Performance of this model depends on its adaptation to the linguistic features. Accordingly, adaptation methods endeavour to apply syntactic and semantic characteristics of the language for language modeling. The previous adaptation methods such as family of Dirichlet class language model (DCLM) extract class of history words. These methods due to lake of syntactic information are not suitable for high morphology languages such as Farsi. This work proposes an idea for using syntactic information such as part-of-speech (POS) in DCLM for combining with an n-gram language model. In our proposed approach, word clustering is based on POS of previous words and history words. The performance of language models are evaluated on BijanKhan corpus using a hidden Markov model based ASR system. Our experiments show that using POS information along with history words and class of history words improves language model, and decreases the perplexity on our corpus. Exploiting POS information along with DCLM, the word error rate of the ASR system decreases by 1% in comparison to DCLM.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

N-gram adaptation using Dirichlet class language model based on part-of-speech for speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Factored language model adaptation using Dirichlet class language model for speech recognition
Ali Hatami ... Babak Nasersharif
-
Ali Hatami, et. al.Ali Hatami ... Babak Nasersharif
01 May 2013
01 May 2013

Language Model Adaptation Using Dirichlet Class Language Model Based on Part-of-Speech
...
-
, et. al. ...
21 Mar 2014
21 Mar 2014

Exploring recurrent neural network based acoustic and linguistic modeling for children's speech recognition
Sreeram Ganji ... Rohit Sinha
-
Sreeram Ganji, et. al.Sreeram Ganji ... Rohit Sinha
01 Nov 2017
01 Nov 2017

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Hirokazu Masataki
-
Ryo Masumura, et. al.Ryo Masumura ... Hirokazu Masataki
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

N-gram adaptation using Dirichlet class language model based on part-of-speech for speech recognition

Abstract

Talk to us

Similar Papers