Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback

Tsung-Wei Tu,Lin-Shan Lee,Hung-Yi Lee

doi:10.1109/asru.2011.6163962

Abstract

This paper reports a new approach to improving spoken term detection that uses support vector machine (SVM) with acoustic and linguistic features. As SVM is a good technique for discriminating different features in vector space, we recently proposed to use pseudo-relevance feedback to automatically generate training data for SVM training and use SVM to re-rank the first-pass results considering the context consistency in the lattices. In this paper, we further extend this concept by considering acoustic features at word, phone and HMM state levels and linguistic features of different order. Extensive experiments under various recognition environments demonstrate significant improvements in all cases. In particular, the acoustic features at the HMM state level offered the most significant improvements, and the improvements achieved by acoustic and linguistic features are shown to be additive.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A method using linguistic and acoustic features to detect inadequate utterances in medical communication
Michihisa Kurisu ... Ryunosuke Wada
-
Michihisa Kurisu, et. al.Michihisa Kurisu ... Ryunosuke Wada
01 Jul 2013
01 Jul 2013

When Siri Knows How You Feel: Study of Machine Learning in Automatic Sentiment Recognition from Human Speech
L Zhang ... E Y K Ng
-
L Zhang, et. al.L Zhang ... E Y K Ng
27 Dec 2018
27 Dec 2018

Automatic speech analysis for detecting cognitive decline of older adults.
Lihe Huang ... Jingjing Yang
Frontiers in public health | VOL. 12
Lihe Huang, et. al.Lihe Huang ... Jingjing Yang
01 Jan 2024
Frontiers in public health | VOL. 12

Emotion Recognition Combining Acoustic and Linguistic Features Based on Speech Recognition Results
Misaki Sakurai ... Tetsuo Kosaka
-
Misaki Sakurai, et. al.Misaki Sakurai ... Tetsuo Kosaka
12 Oct 2021
12 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved spoken term detection using support vector machines with acoustic and context features from pseudo-relevance feedback

Abstract

Talk to us

Similar Papers