Convex weighting criteria for speaking rate estimation.

Yishan Jiao,Ming Tu,Julie Liss,Visar Berisha

doi:10.1109/taslp.2015.2434213

Abstract

Speaking rate estimation directly from the speech waveform is a long-standing problem in speech signal processing. In this paper, we pose the speaking rate estimation problem as that of estimating a temporal density function whose integral over a given interval yields the speaking rate within that interval. In contrast to many existing methods, we avoid the more difficult task of detecting individual phonemes within the speech signal and we avoid heuristics such as thresholding the temporal envelope to estimate the number of vowels. Rather, the proposed method aims to learn an optimal weighting function that can be directly applied to time-frequency features in a speech signal to yield a temporal density function. We propose two convex cost functions for learning the weighting functions and an adaptation strategy to customize the approach to a particular speaker using minimal training. The algorithms are evaluated on the TIMIT corpus, on a dysarthric speech corpus, and on the ICSI Switchboard spontaneous speech corpus. Results show that the proposed methods outperform three competing methods on both healthy and dysarthric speech. In addition, for spontaneous speech rate estimation, the result show a high correlation between the estimated speaking rate and ground truth values.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Convex weighting criteria for speaking rate estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing

Lead the way for us

Journal: IEEE/ACM transactions on audio, speech, and language processing	Publication Date: Jul 23, 2015
Citations: 26

Similar Papers

Zero Resource Speaking Rate Estimation from Change Point Detection of Syllable-like Units
Shekhar Nayak ... Saurabhchand Bhati
-
Shekhar Nayak, et. al.Shekhar Nayak ... Saurabhchand Bhati
01 May 2019
01 May 2019

Preliminary Speech Rate Normative Data in Adult Jordanian Speakers
Mohammad A Damhoureyeh ... Yaser S Natour
Journal of Language Teaching and Research | VOL. 11
Mohammad A Damhoureyeh, et. al.Mohammad A Damhoureyeh ... Yaser S Natour
01 Mar 2020
Journal of Language Teaching and Research | VOL. 11

Characteristics of Speaking Rate in the Dysarthria Associated With Amyotrophic Lateral Sclerosis
Greg S Turner ... Gary Weismer
Journal of Speech, Language, and Hearing Research | VOL. 36
Greg S Turner, et. al.Greg S Turner ... Gary Weismer
01 Dec 1993
Journal of Speech, Language, and Hearing Research | VOL. 36

Classification of Bisyllabic Lexical Stress Patterns Using Deep Neural Networks
Mostafa Shahin ... Beena Ahmed
-
Mostafa Shahin, et. al.Mostafa Shahin ... Beena Ahmed
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Convex weighting criteria for speaking rate estimation.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM transactions on audio, speech, and language processing