Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Ilyes Rebai,Yassine Ben Ayed,Walid Mahdi

doi:10.1007/s11042-018-6276-y

Abstract

Keyword search for spoken documents has become more and more important nowadays due to the increasing amount of spoken data. The typical system makes use of an Automatic Speech Recognition system (ASR) and information retrieval methods. While a number of studies have been done to get the optimal system performance, KeyWord Search (KWS) systems still suffer from two main drawbacks. First, the system performance depends strongly on the ASR transcripts which are inherently inexact. Due to the speech signal variabilities, ASR systems are far from being powerful. Second, KWS systems make detection decisions based on the lattice-based posterior probability which is incomparable across keywords. In addition, posterior probabilities of true detection usually fall into different ranges which decrease the spotting performance. This paper considers the problems of ASR transcriptions and keyword detection decision based on posterior probabilities. More specifically, we propose to enhance the ASR transcripts accuracy by introducing a new ASR architecture in which we integrate data augmentation and ensemble learning techniques into a single framework. In addition, we proposed a novel keyword rescoring method that provides scores from a new perspective. Precisely, inspired by template-based KWS approach, scores of similarity between the detected keywords are computed by computing the distance between the acoustic features and are used as new scores for decision. Experiments on French and English datasets show that the proposed KWS system potentially leads to more accurate keyword results than the conventional systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications

Lead the way for us

Journal: Multimedia Tools and Applications	Publication Date: Jun 25, 2018
Citations: 2

Similar Papers

End-to-End ASR-Free Keyword Search From Speech
Kartik Audhkhasi ... Brian Kingsbury
IEEE Journal of Selected Topics in Signal Processing | VOL. 11
Kartik Audhkhasi, et. al.Kartik Audhkhasi ... Brian Kingsbury
13 Jan 2017
IEEE Journal of Selected Topics in Signal Processing | VOL. 11

Toward scalable keyword search over relational data
Akanksha Baid ... Jeffrey Naughton
Proceedings of the VLDB Endowment | VOL. 3
Akanksha Baid, et. al.Akanksha Baid ... Jeffrey Naughton
01 Sep 2010
Proceedings of the VLDB Endowment | VOL. 3

A Keyword-Aware Language Modeling Approach to Spoken Keyword Search
I-Fan Chen ... Boon Pang Lim
Journal of Signal Processing Systems | VOL. 82
I-Fan Chen, et. al.I-Fan Chen ... Boon Pang Lim
21 Apr 2015
Journal of Signal Processing Systems | VOL. 82

Toward industrial-strength keyword search systems over relational data
Akanksha Baid ... Jeffrey F Naughton
-
Akanksha Baid, et. al.Akanksha Baid ... Jeffrey F Naughton
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Spoken keyword search system using improved ASR engine and novel template-based keyword scoring

Abstract

Talk to us

Similar Papers

More From: Multimedia Tools and Applications