Normal-rate to fast-rate speech conversion using non-linear compression maps

Michael D Fry,Eric Vatikiotis-Bateson

doi:10.1121/1.4969172

Abstract

This paper presents a new technique to convert normal-rate speech into intelligible fast-rate, speeded speech. Speeded speech has long been recognized for its potential to improve spoken media comprehension; however, current tools to significantly speed playback of non-text media are insufficient due to their reliance on inaccurate phoneme analysis. With the ever increasing amount of non-text media online, a method to speed playback that is agnostic of phonemes is needed. Our technique uses spectral and source components of the acoustics to generate a non-linear compression map that characterizes how conversational-rate speech signals are compressed to achieve analogue fast-rate speech signals. A data set containing conversational- and fast-rate speech pairs was processed to determine compression maps corresponding to each pair. A Recursive Neural Network (RNN) was trained on the set of normal-rate speech and the corresponding compression maps. The RNN was then used to generate compression maps for novel normal-rate speech and ultimately output a fast-rate speech signal. Elicited fast-rate speech and speeded speech conversions technique are now being compared perceptually for intelligibility and naturalness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Normal-rate to fast-rate speech conversion using non-linear compression maps

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Oct 1, 2016
Citations: 1

Similar Papers

Effects of alterations in auditory feedback on stuttering frequency during fast and normal speech rates
J Kalinowski ... A Stuart
Journal of Fluency Disorders | VOL. 19
J Kalinowski, et. al.J Kalinowski ... A Stuart
01 Sep 1994
Journal of Fluency Disorders | VOL. 19

Stuttering inhibition via visual feedback at normal and fast speech rates
Daniel Hudock ... Vikram N Dayalu
International Journal of Language & Communication Disorders | VOL. 46
Daniel Hudock, et. al.Daniel Hudock ... Vikram N Dayalu
09 Jul 2010
International Journal of Language & Communication Disorders | VOL. 46

Stuttering amelioration at various auditory feedback delays and speech rates
Joseph Kalinowski ... Sarah Sark
International Journal of Language & Communication Disorders | VOL. 31
Joseph Kalinowski, et. al.Joseph Kalinowski ... Sarah Sark
01 Jul 1996
International Journal of Language & Communication Disorders | VOL. 31

Effect of Frequency-Altered Feedback on Stuttering Frequency at Normal and Fast Speech Rates
Stephanie Hargrave ... Andrew Stuart
Journal of Speech, Language, and Hearing Research | VOL. 37
Stephanie Hargrave, et. al.Stephanie Hargrave ... Andrew Stuart
01 Dec 1994
Journal of Speech, Language, and Hearing Research | VOL. 37

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Normal-rate to fast-rate speech conversion using non-linear compression maps

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America