Topic-Dependent Language Model with Voting on Noun History

Welly Naptali,Masatoshi Tsuchiya,Seiichi Nakagawa

doi:10.1145/1781134.1781137

Abstract

Language models (LMs) are an important field of study in automatic speech recognition (ASR) systems. LM helps acoustic models find the corresponding word sequence of a given speech signal. Without it, ASR systems would not understand the language and it would be hard to find the correct word sequence. During the past few years, researchers have tried to incorporate long-range dependencies into statistical word-based n -gram LMs. One of these long-range dependencies is topic. Unlike words, topic is unobservable. Thus, it is required to find the meanings behind the words to get into the topic. This research is based on the belief that nouns contain topic information. We propose a new approach for a topic-dependent LM, where the topic is decided in an unsupervised manner. Latent Semantic Analysis (LSA) is employed to reveal hidden (latent) relations among nouns in the context words. To decide the topic of an event, a fixed size word history sequence (window) is observed, and voting is then carried out based on noun class occurrences weighted by a confidence measure. Experiments were conducted on an English corpus and a Japanese corpus: The Wall Street Journal corpus and Mainichi Shimbun (Japanese newspaper) corpus. The results show that our proposed method gives better perplexity than the comparative baselines, including a word-based/class-based n -gram LM, their interpolated LM, a cache-based LM, a topic-dependent LM based on n -gram, and a topic-dependent LM based on Latent Dirichlet Allocation (LDA). The n -best list rescoring was conducted to validate its application in ASR systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Topic-Dependent Language Model with Voting on Noun History

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian Language Information Processing	Publication Date: Jun 1, 2010
Citations: 26

Similar Papers

Topic dependent language model based on topic voting on noun history
Welly Naptali ... Seiichi Nakagawa
-
Welly Naptali, et. al.Welly Naptali ... Seiichi Nakagawa
06 Sep 2009
06 Sep 2009

Neural Speech-to-Text Language Models for Rescoring Hypotheses of DNN-HMM Hybrid Automatic Speech Recognition Systems
Tomohiro Tanaka ... Ryo Masumura
-
Tomohiro Tanaka, et. al.Tomohiro Tanaka ... Ryo Masumura
01 Nov 2018
01 Nov 2018

Increasing the Accuracy of the ASR System by Prolonging Voiceless Phonemes in the Speech of Patients Using the Electrolarynx
Petr Stanislav ... Josef V Psutka
-
Petr Stanislav, et. al.Petr Stanislav ... Josef V Psutka
01 Jan 2020
01 Jan 2020

Non-Native Pronunciation Variation Modeling for Automatic Speech Recognition
Hong Kook ... Mina Kim
-
Hong Kook, et. al.Hong Kook ... Mina Kim
16 Aug 2010
16 Aug 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Topic-Dependent Language Model with Voting on Noun History

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian Language Information Processing