Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection

Peng Yang,Lei Xie,Bin Ma,Cheung-Chi Leung,Haizhou Li

doi:10.21437/interspeech.2014-394

Abstract

We investigate the use of intrinsic spectral analysis (ISA) for query-by-example spoken term detection (QbE-STD). In the task, spoken queries and test utterances in an audio archive are converted to ISA features, and dynamic time warping is applied to match the feature sequence in each query with those in test utterances. Motivated by manifold learning, ISA has been proposed to recover from untranscribed utterances a set of nonlinear basis functions for the speech manifold, and shown with improved phonetic separability and inherent speaker independence. Due to the coarticulation phenomenon in speech, we propose to use temporal context information to obtain the ISA features. Gaussian posteriorgram, as an efficient acoustic representation usually used in QbE-STD, is considered a baseline feature. Experimental results on the TIMIT speech corpus show that the ISA features can provide a relative 13.5% improvement in mean average precision over the baseline features, when the temporal context information is used. Index Terms: spoken term detection, intrinsic spectral analysis, Gaussian posteriorgram, dynamic time warping

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Partial matching and search space reduction for QbE-STD
Maulik C Madhavi ... Hemant A Patil
Computer Speech & Language | VOL. 45
Maulik C Madhavi, et. al.Maulik C Madhavi ... Hemant A Patil
28 Mar 2017
Computer Speech & Language | VOL. 45

Combining evidences from detection sources for query-by-example spoken term detection
Maulik C Madhavi ... Hemant A Patil
-
Maulik C Madhavi, et. al.Maulik C Madhavi ... Hemant A Patil
01 Dec 2017
01 Dec 2017

Analysis of constraints on segmental DTW for the task of query-by-example spoken term detection
Sri Harsha Dumpala ... Anil Kumar Vuppala
-
Sri Harsha Dumpala, et. al.Sri Harsha Dumpala ... Anil Kumar Vuppala
01 Dec 2015
01 Dec 2015

Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering
Gautam Mantena ... Xavier Anguera
-
Gautam Mantena, et. al.Gautam Mantena ... Xavier Anguera
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection

Abstract

Talk to us

Similar Papers