Temporal Compression Of Speech: An Evaluation

S Tucker,S Whittaker

doi:10.1109/tasl.2008.916527

Abstract

Efficient browsing of speech recordings is problematic. The linear nature of speech, coupled with the lack of abstraction that the medium affords, means that listeners have to listen to long segments of a recording to locate points of interest. We explore temporal compression algorithms that attempt to reduce the amount of time users require to listen to speech recordings, while retaining the important content. This paper implements two main approaches to temporal compression: artificial speech rate alteration (speed-up) and unimportant segment removal (excision). We evaluate the effectiveness of these approaches by having listeners rate comprehension and listening effort for different types of temporal compression. For different compression levels, we compare performance of various implementations of speed-up and excision as well as techniques based on semantic features and acoustic features. Our results indicate that listeners prefer low compression levels, excision over speed-up, and algorithms based on semantic rather than acoustic features. Finally, listeners were negative about hybrid algorithms that used speed-up to indicate missing regions in an excised recording.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Temporal Compression Of Speech: An Evaluation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: May 1, 2008
Citations: 35

Similar Papers

Speaker-listener neural coupling correlates with semantic and acoustic features of naturalistic speech.
Zhuoran Li ... Dan Zhang
Social cognitive and affective neuroscience | VOL. 19
Zhuoran Li, et. al.Zhuoran Li ... Dan Zhang
16 Jul 2024
Social cognitive and affective neuroscience | VOL. 19

The effects of speech masking on neural tracking of acoustic and semantic features of natural speech
Sonia Yasmin ... Björn Herrmann
Neuropsychologia | VOL. 186
Sonia Yasmin, et. al.Sonia Yasmin ... Björn Herrmann
09 May 2023
Neuropsychologia | VOL. 186

Combining semantic and acoustic features for valence and arousal recognition in speech
Seliz Gulsen Karadogan ... Jan Larsen
-
Seliz Gulsen Karadogan, et. al.Seliz Gulsen Karadogan ... Jan Larsen
01 May 2012
01 May 2012

Multi-modal activity and dominance detection in smart meeting rooms
Benedikt Hornler ... Gerhard Rigoll
-
Benedikt Hornler, et. al.Benedikt Hornler ... Gerhard Rigoll
01 Apr 2009
01 Apr 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Temporal Compression Of Speech: An Evaluation

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing