High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation

Luis J Rodriguez-Fuentes,Mikel Penagarikano,German Bordel,Mireia Diez,Amparo Varona

doi:10.1109/icassp.2014.6855122

Luis J Rodriguez-Fuentes, Mikel Penagarikano + Show 3 more

Open Access

https://doi.org/10.1109/icassp.2014.6855122

Copy DOI

Abstract

In the last years, the task of Query-by-Example Spoken Term Detection (QbE-STD), which aims to find occurrences of a spoken query in a set of audio documents, has gained the interest of the research community for its versatility in settings where untranscribed, multilingual and acoustically unconstrained spoken resources, or spoken resources in low-resource languages, must be searched. This paper describes and reports experimental results for a QbE-STD system that achieved the best performance in the recent Spoken Web Search (SWS) evaluation, held as part of MediaEval 2013. Though not optimized for speed, the system operates faster than real-time. The system exploits high-performance phone decoders to extract frame-level phone posteriors (a common representation in QbE-STD tasks). Then, given a query and a audio document, a distance matrix is computed between their phone posterior representations, followed by a newly introduced distance normalization technique and an iterative Dynamic Time Warping (DTW) matching procedure with some heuristic prunings. Results show that remarkable performance improvements can be achieved by using multiple examples per query and, specially, through the late (score-level) fusion of different subsystems, each based on a different set of phone posteriors.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Combining evidences from detection sources for query-by-example spoken term detection
Maulik C Madhavi ... Hemant A Patil
-
Maulik C Madhavi, et. al.Maulik C Madhavi ... Hemant A Patil
01 Dec 2017
01 Dec 2017

Sparse Subspace Modeling for Query by Example Spoken Term Detection
Dhananjay Ram ... Herve Bourlard
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26
Dhananjay Ram, et. al.Dhananjay Ram ... Herve Bourlard
01 Jun 2018
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 26

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection
Peng Yang ... Haizhou Li
-
Peng Yang, et. al.Peng Yang ... Haizhou Li
14 Sep 2014
14 Sep 2014

Partial matching and search space reduction for QbE-STD
Maulik C Madhavi ... Hemant A Patil
Computer Speech & Language | VOL. 45
Maulik C Madhavi, et. al.Maulik C Madhavi ... Hemant A Patil
28 Mar 2017
Computer Speech & Language | VOL. 45

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

High-performance Query-by-Example Spoken Term Detection on the SWS 2013 evaluation

Abstract

Talk to us

Similar Papers