Deep semantic encodings for language modeling

Ali Orkan Bayer,Giuseppe Riccardi

doi:10.21437/interspeech.2015-346

Abstract

Word error rate (WER) is not an appropriate metric for spoken language systems (SLS) because lower WER does not necessarily yield better understanding performance. Therefore, language models (LMs) that are used in SLS should be trained to jointly optimize transcription and understanding performance. Semantic LMs (SELMs) are based on the theory of frame semantics and incorporate features of frames and meaning bearing words (target words) as semantic context when training LMs. The performance of SELMs is affected by the errors on the ASR and the semantic parser output. In this paper we address the problem of coping with such noise in the training phase of the neural network-based architecture of LMs. We propose the use of deep autoencoders for the encoding of semantic context while accounting for ASR errors. We investigate the optimization of SELMs both for transcription and understanding by using deep semantic encodings. Deep semantic encodings suppress the noise introduced by the ASR module, and enable SELMs to be optimized adequately. We assess the understanding performance by measuring the errors made on target words and we achieve 3.7% relative improvement over recurrent neural network LMs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep semantic encodings for language modeling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Semantic language models for Automatic Speech Recognition
Ali Orkan Bayer ... Giuseppe Riccardi
-
Ali Orkan Bayer, et. al.Ali Orkan Bayer ... Giuseppe Riccardi
01 Dec 2014
01 Dec 2014

Semantic language models with deep neural networks
Ali Orkan Bayer ... Giuseppe Riccardi
Computer Speech & Language | VOL. 40
Ali Orkan Bayer, et. al.Ali Orkan Bayer ... Giuseppe Riccardi
04 May 2016
Computer Speech & Language | VOL. 40

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Ryo Masumura ... Taichi Asami
-
Ryo Masumura, et. al.Ryo Masumura ... Taichi Asami
01 Dec 2017
01 Dec 2017

Training RNN language models on uncertain ASR hypotheses in limited data scenarios
Imran Sheikh ... Irina Illina
Computer Speech & Language | VOL. 83
Imran Sheikh, et. al.Imran Sheikh ... Irina Illina
20 Aug 2023
Computer Speech & Language | VOL. 83

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep semantic encodings for language modeling

Abstract

Talk to us

Similar Papers