Double-layer neighborhood graph based similarity search for fast query-by-example spoken term detection

Kazuo Aoyama,Takashi Hattori,Takaaki Hori,Atsunori Ogawa

doi:10.1109/icassp.2015.7178966

Abstract

This paper presents a novel double-layer neighborhood graph index for acceleration of similarity search that accomplishes fast querybyexample spoken term detection (STD). When a query segment is given, our proposed STD method finds similar segments to the query from an utterance data set by efficient similarity search that traverses the double-layer neighborhood graph (DLG) with a low computational cost. The segment is a sequence of Gaussian mixture model posteriorgram frames and corresponds to a vertex in the DLG. A dissimilarity between vertices is measured by dynamic time warping. The DLG consists of two distinct degree-reduced k-nearest neighbor graphs in a base and an upper layer. The base layer's graph has all the vertices in the data set while the upper layer's graph includes only representatives extracted from the vertices in the base layer. By way of analogy, search in the DLG resembles driving on general roads and express highways appropriately for travel-time saving. Experimental results on the MIT lecture corpus demonstrate that the proposed method achieves CPU time reduction by 40% and more than 60% compared to the most recent method and the ordinary graphbased method, keeping almost the same precision.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Double-layer neighborhood graph based similarity search for fast query-by-example spoken term detection

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Representation Learning for Spoken Term Detection
P Raghavendra Reddy ... B Yegnanarayana
-
P Raghavendra Reddy, et. al.P Raghavendra Reddy ... B Yegnanarayana
07 Dec 2016
07 Dec 2016

Experimental studies on effect of speaking mode on spoken term detection
Kallola Rout ... Pappagari Raghavendra Reddy
-
Kallola Rout, et. al.Kallola Rout ... Pappagari Raghavendra Reddy
01 Feb 2015
01 Feb 2015

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection
Peng Yang ... Haizhou Li
-
Peng Yang, et. al.Peng Yang ... Haizhou Li
14 Sep 2014
14 Sep 2014

Re-ranking of spoken term detections using CRF-based triphone detection models
Naoki Sawada ... Satoshi Natori
-
Naoki Sawada, et. al.Naoki Sawada ... Satoshi Natori
01 Dec 2014
01 Dec 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Double-layer neighborhood graph based similarity search for fast query-by-example spoken term detection

Abstract

Talk to us

Similar Papers