Confidence estimation and keyword extraction from speech recognition result based on Web information

Hara Kensuke,Hayamizu Satoru,Kawase Tetsuya,Tamura Satoshi,Sekiya Hideki

doi:10.1109/apsipa.2013.6694114

Abstract

This paper proposes to use Web information for confidence measure and to extract keywords for speech recognition results. Spoken document processing has been attracting attention particularly for information retrieval and video (audiovisual) content systems. For example, measuring a confidence score which indicates how likely a document or a segmented document includes recognition errors has been studied. It is well known keyword extraction from recognition results is also an important issue. For these purposes, in this paper, pointwise mutual information (PMI) between two words is employed. PMI has been used to calculate a confidence measure of speech recognition, as a coherence measure by co-occurrence of words. We propose to further improve the method by using a Web query expansion technique with term triplets which consist of nouns in the same document. We also apply PMI to keyword estimation by summing a co-occurrence score (sumPMI) between a targeting keyword candidate and each term. The proposed methods were tested with 10 lectures in Corpus of Spontaneous Japanese (CSJ) and 2 simulated movie dialogues. In the experiments it is shown that the estimated confidence score has high relationship with recognition accuracy, indicating the effectiveness of our method. And sumPMI scores for keywords have higher values in the subjective tests.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Confidence estimation and keyword extraction from speech recognition result based on Web information

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Hayato Futami ... Hirofumi Inaguma
-
Hayato Futami, et. al.Hayato Futami ... Hirofumi Inaguma
25 Oct 2020
25 Oct 2020

Analysis and recognition of spontaneous speech using Corpus of Spontaneous Japanese
Sadaoki Furui ... Koji Iwano
Speech Communication | VOL. 47
Sadaoki Furui, et. al.Sadaoki Furui ... Koji Iwano
02 Apr 2005
Speech Communication | VOL. 47

Why Is the Recognition of Spontaneous Speech so Hard?
Sadaoki Furui ... Tomohisa Ichiba
-
Sadaoki Furui, et. al.Sadaoki Furui ... Tomohisa Ichiba
01 Jan 2004
01 Jan 2004

Committee-Based Active Learning for Speech Recognition
Yuzo Hamanaka ... Sadaoki Furui
IEICE Transactions on Information and Systems | VOL. E94-D
Yuzo Hamanaka, et. al.Yuzo Hamanaka ... Sadaoki Furui
01 Jan 2010
IEICE Transactions on Information and Systems | VOL. E94-D

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Confidence estimation and keyword extraction from speech recognition result based on Web information

Abstract

Talk to us

Similar Papers