Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings

Shane Settle,Keith Levin,Karen Livescu,Herman Kamper

doi:10.21437/interspeech.2017-1592

Abstract

Query-by-example search often uses dynamic time warping (DTW) for comparing queries and proposed matching segments. Recent work has shown that comparing speech segments by representing them as fixed-dimensional vectors --- acoustic word embeddings --- and measuring their vector distance (e.g., cosine distance) can discriminate between words more accurately than DTW-based approaches. We consider an approach to query-by-example search that embeds both the query and database segments according to a neural model, followed by nearest-neighbor search to find the matching segments. Earlier work on embedding-based query-by-example, using template-based acoustic word embeddings, achieved competitive performance. We find that our embeddings, based on recurrent neural networks trained to optimize word discrimination, achieve substantial improvements in performance and run-time efficiency over the previous approaches.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Query-by-Example Speech Search Using Recurrent Neural Acoustic Word Embeddings With Temporal Context
Yougen Yuan ... Hongjie Chen
IEEE Access | VOL. 7
Yougen Yuan, et. al.Yougen Yuan ... Hongjie Chen
01 Jan 2019
IEEE Access | VOL. 7

Learning Acoustic Word Embeddings With Dynamic Time Warping Triplet Networks
Denis Shitov ... Margaret Lech
IEEE Access | VOL. 8
Denis Shitov, et. al.Denis Shitov ... Margaret Lech
01 Jan 2020
IEEE Access | VOL. 8

Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search
Yougen Yuan ... Bin Ma
-
Yougen Yuan, et. al.Yougen Yuan ... Bin Ma
02 Sep 2018
02 Sep 2018

Discriminative acoustic word embeddings: Tecurrent neural network-based approaches
Shane Settle ... Karen Livescu
-
Shane Settle, et. al.Shane Settle ... Karen Livescu
01 Dec 2016
01 Dec 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Query-by-Example Search with Discriminative Neural Acoustic Word Embeddings

Abstract

Talk to us

Similar Papers