Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition

Ri Hyon Sun,Ri Jong Chol

doi:10.1016/j.specom.2020.01.001

Abstract

This paper focuses on adaptable continuous space language modeling approach of combining longer context information of recurrent neural network (RNN) with adaptation ability of subspace Gaussian mixture model (SGMM) which has been widely used in acoustic modeling for automatic speech recognition (ASR).In large vocabulary continuous speech recognition (LVCSR) it is a challenging problem to construct language models that can capture the longer context information of words and ensure generalization and adaptation ability. Recently, language modeling based on RNN and its variants have been broadly studied in this field.The goal of our approach is to obtain the history feature vectors of a word with longer context information and model every word by subspace Gaussian mixture model such as Tandem system used in acoustic modeling for ASR. Also, it is to apply fMLLR adaptation method, which is widely used in SGMM based acoustic modeling, for adaptation of subspace Gaussian mixture based language model (SGMLM).After fMLLR adaptation, SGMLMs based on Top-Down and Bottom-Up obtain WERs of 5.70 % and 6.01%, which are better than 4.15% and 4.61% of that without adaptation, respectively. Also, with fMLLR adaptation, Top-Down and Bottom-Up based SGMLMs yield absolute word error rate reduction of 1.48%, 1.02% and a relative perplexity reduction of 10.02%, 6.46% compared to RNNLM without adaptation, respectively.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Jan 23, 2020
Citations: 7

Similar Papers

Automatic Speech Recognition Based on Neural Networks
Ralf Schlüter ... Pavel Golik
-
Ralf Schlüter, et. al.Ralf Schlüter ... Pavel Golik
01 Jan 2015
01 Jan 2015

Using different acoustic, lexical and language modeling units for ASR of an under-resourced language – Amharic
Martha Yifiru Tachbelie ... Laurent Besacier
Speech Communication | VOL. 56
Martha Yifiru Tachbelie, et. al.Martha Yifiru Tachbelie ... Laurent Besacier
14 Feb 2013
Speech Communication | VOL. 56

A back-off discriminative acoustic model for automatic speech recognition
Hung-An Chang ... James R Glass
-
Hung-An Chang, et. al.Hung-An Chang ... James R Glass
06 Sep 2009
06 Sep 2009

Semantic language models for Automatic Speech Recognition
Ali Orkan Bayer ... Giuseppe Riccardi
-
Ali Orkan Bayer, et. al.Ali Orkan Bayer ... Giuseppe Riccardi
01 Dec 2014
01 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Subspace Gaussian mixture based language modeling for large vocabulary continuous speech recognition

Abstract

Talk to us

Similar Papers

More From: Speech Communication