Unsupervised Discovery of Structured Acoustic Tokens With Applications to Spoken Term Detection

Cheng-Tao Chung,Lin-Shan Lee

doi:10.1109/taslp.2017.2778948

Abstract

In this paper, we compare two paradigms for unsupervised discovery of structured acoustic tokens directly from speech corpora without any human annotation. The multigranular paradigm seeks to capture all available information in the corpora with multiple sets of tokens for different model granularities. The hierarchical paradigm attempts to jointly learn several levels of signal representations in a hierarchical structure. The two paradigms are unified within a theoretical framework in this paper. Query-by-example spoken term detection (QbE-STD) experiments on the query by example search on speech task dataset of MediaEval 2015 verifies the competitiveness of the acoustic tokens. The enhanced relevance score proposed in this work improves both paradigms for the task of QbE-STD. We also list results on the ABX evaluation task of the Zero Resource Challenge 2015 for comparison of the paradigms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Discovery of Structured Acoustic Tokens With Applications to Spoken Term Detection

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Feb 1, 2018
Citations: 47

Similar Papers

Spoken Term Detection Techniques
Leena Mary ... Deekshitha G
-
Leena Mary, et. al.Leena Mary ... Deekshitha G
26 Sep 2018
26 Sep 2018

Intrinsic spectral analysis based on temporal context features for query-by-example spoken term detection
Peng Yang ... Haizhou Li
-
Peng Yang, et. al.Peng Yang ... Haizhou Li
14 Sep 2014
14 Sep 2014

An iterative deep learning framework for unsupervised discovery of speech features and linguistic units with applications on spoken term detection
Cheng-Tao Chung ... Lin-Shan Lee
-
Cheng-Tao Chung, et. al.Cheng-Tao Chung ... Lin-Shan Lee
01 Dec 2015
01 Dec 2015

Query-by-Example Spoken Term Detection using low dimensional posteriorgrams motivated by articulatory classes
Abhimanyu Popli ... Arun Kumar
Control theory & applications | VOL. 18
Abhimanyu Popli, et. al.Abhimanyu Popli ... Arun Kumar
01 Oct 2015
Control theory & applications | VOL. 18

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Discovery of Structured Acoustic Tokens With Applications to Spoken Term Detection

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing