Use of articulatory bottle-neck features for query-by-example spoken term detection in low resource scenarios

Gautam Mantena,Kishore Prahallad

doi:10.1109/icassp.2014.6854983

Abstract

For query-by-example spoken term detection (QbE-STD), generation of phone posteriorgrams requires labelled data which would be difficult for languages with low resources. One solution is to build models from rich resource languages and use them in the low resource scenario. However, phone classes are not language universal and alternate representation such as articulatory classes is explored. In this paper, we use articulatory information and their derivatives such as bottle-neck (BN) features (also referred to as articulatory BN features) for QbE-STD. We obtain Gaussian posteriorgrams of articulatory BN features in tandem with the acoustic parameters such as frequency domain linear prediction cepstral coefficients to perform the search. We compare the search performance of articulatory and phone BN features and show that articulatory BN features are a better representation. We also provide experimental results to show that low amounts (30 mins) of training data could be used to derive articulatory BN features.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Use of articulatory bottle-neck features for query-by-example spoken term detection in low resource scenarios

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Multilingual Bottleneck Features for Query by Example Spoken Term Detection
Dhananjay Ram ... Lesly Miculicich
-
Dhananjay Ram, et. al.Dhananjay Ram ... Lesly Miculicich
01 Dec 2019
01 Dec 2019

Query-by-Example Spoken Term Detection using low dimensional posteriorgrams motivated by articulatory classes
Abhimanyu Popli ... Arun Kumar
Control theory & applications | VOL. 18
Abhimanyu Popli, et. al.Abhimanyu Popli ... Arun Kumar
01 Oct 2015
Control theory & applications | VOL. 18

Neural Network Based End-to-End Query by Example Spoken Term Detection
Dhananjay Ram ... Herve Bourlard
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28
Dhananjay Ram, et. al.Dhananjay Ram ... Herve Bourlard
01 Jan 2020
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 28

CNN-based bottleneck feature for noise robust query-by-example spoken term detection
Hyungjun Lim ... Yoonhoe Kim
-
Hyungjun Lim, et. al.Hyungjun Lim ... Yoonhoe Kim
01 Dec 2017
01 Dec 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Use of articulatory bottle-neck features for query-by-example spoken term detection in low resource scenarios

Abstract

Talk to us

Similar Papers