Stressed speech processing: Human vs automatic in non-professional speakers scenario

Sumitra Shukla,S Dandapat,S R M Prasanna

doi:10.1109/ncc.2011.5734704

Abstract

This study analyzes the effect of stress in human and automatic stressed speech processing tasks for speech collected from non-professional speakers. The database of 33 keywords is collected under five stress conditions, namely, neutral, angry, happy, sad and Lombard from fifteen speakers. The first study is to understand the ability to identify stress by human and automatic speech processing. The average performance of human stress classification is 59.44%. The average performance of automatic stress classifier using Vector Quantization (VQ) and Hidden Markov model (HMM) is 54.65% and 56.02%, respectively. The second study has been done to understand the effect of stress in human and automatic speech recognition. The average performance of human stressed speech recognition is 99.60%. The automatic stressed speech recognition performance using VQ and HMM is 82.42% and 76.79%, respectively. Even in the non-professional speakers scenario, human performance is better than automatic processing. Also, automatic processing seem to show considerable degradation in performance that warrant development of new methods to handle stress information.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Stressed speech processing: Human vs automatic in non-professional speakers scenario

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Novel speech processing techniques for robust automatic speech recognition

-

01 Jan 2006
01 Jan 2006

Discrete-Mixture HMMs-based Approach for Noisy Speech Recognition
Tetsuo Kosaka ... Masaki Koh
-
Tetsuo Kosaka, et. al.Tetsuo Kosaka ... Masaki Koh
01 Jun 2007
01 Jun 2007

N-channel hidden Markov models for combined stressed speech classification and recognition
B.D Womack ... J.H.L Hansen
IEEE Transactions on Speech and Audio Processing | VOL. 7
B.D Womack, et. al.B.D Womack ... J.H.L Hansen
01 Jan 1998
IEEE Transactions on Speech and Audio Processing | VOL. 7

Early Decision Making in Continuous Speech
Odette Scharenborg ... Lou Boves
-
Odette Scharenborg, et. al.Odette Scharenborg ... Lou Boves
01 Jun 2007
01 Jun 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Stressed speech processing: Human vs automatic in non-professional speakers scenario

Abstract

Talk to us

Similar Papers