Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering

Gautam Mantena,Xavier Anguera

doi:10.1109/icassp.2013.6639327

Abstract

With the increase in multi-media data over the Internet, query by example spoken term detection (QbE-STD) has become important in providing a search mechanism to find spoken queries in spoken audio. Audio search algorithms should be efficient in terms of speed and memory to handle large audio files. In general, approaches derived from the well known dynamic time warping (DTW) algorithm suffer from scalability problems. To overcome such problems, an Information Retrieval-based DTW (IR-DTW) algorithm has been proposed recently. IR-DTW borrows techniques from Information Retrieval community to detect regions which are more likely to contain the spoken query and then uses a standard DTW to obtain exact start and end times. One drawback of the IR-DTW is the time taken for the retrieval of similar reference points for a given query point. In this paper we propose a method to improve the search performance of IR-DTW algorithm using a clustering based technique. The proposed method has shown an estimated speedup of 2400X.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Partial matching and search space reduction for QbE-STD
Maulik C Madhavi ... Hemant A Patil
Computer Speech & Language | VOL. 45
Maulik C Madhavi, et. al.Maulik C Madhavi ... Hemant A Patil
28 Mar 2017
Computer Speech & Language | VOL. 45

Query-by-example spoken term detection using bottleneck feature and Hidden Markov model
Xue Liu ... Niansong Wang
-
Xue Liu, et. al. Xue Liu ... Niansong Wang
01 Aug 2015
01 Aug 2015

A fast query-by-example spoken term detection for zero resource languages
Pandia D.S Karthik ... Hema A Murthy
-
Pandia D.S Karthik, et. al.Pandia D.S Karthik ... Hema A Murthy
01 Jun 2016
01 Jun 2016

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection
Peng Yang ... Haizhou Li
-
Peng Yang, et. al.Peng Yang ... Haizhou Li
14 Sep 2014
14 Sep 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speed improvements to Information Retrieval-based dynamic time warping using hierarchical K-Means clustering

Abstract

Talk to us

Similar Papers