Attention mechanism based LSTM in classification of stressed speech under workload

Xiao Yao,Min Gu,Haibin Wang,Zhengyan Sheng,Ning Xu,Xiaofeng Liu

doi:10.3233/ida-205429

Abstract

In order to improve the robustness of speech recognition systems, this study attempts to classify stressed speech caused by the psychological stress under multitasking workloads. Due to the transient nature and ambiguity of stressed speech, the stress characteristics is not represented in all the segments in stressed speech as labeled. In this paper, we propose a multi-feature fusion model based on the attention mechanism to measure the importance of segments for stress classification. Through the attention mechanism, each speech frame is weighted to reflect the different correlations to the actual stressed state, and the multi-channel fusion of features characterizing the stressed speech to classify the speech under stress. The proposed model further adopts SpecAugment in view of the feature spectrum for data augment to resolve small sample sizes problem among stressed speech. During the experiment, we compared the proposed model with traditional methods on CASIA Chinese emotion corpus and Fujitsu stressed speech corpus, and results show that the proposed model has better performance in speaker-independent stress classification. Transfer learning is also performed for speaker-dependent classification for stressed speech, and the performance is improved. The attention mechanism shows the advantage for continuous speech under stress in authentic context comparing with traditional methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Attention mechanism based LSTM in classification of stressed speech under workload

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis

Lead the way for us

Similar Papers

Classification of speech under stress based on features derived from the nonlinear Teager energy operator
Guojun Zhou ... J.H.L Hansen
-
Guojun Zhou, et. al. Guojun Zhou ... J.H.L Hansen
12 May 1998
12 May 1998

Vector Quantization Approach to Classification of Stressed Speech
S Ramamohan ... S Dandapat
IETE Journal of Research | VOL. 50
S Ramamohan, et. al.S Ramamohan ... S Dandapat
01 Jul 2004
IETE Journal of Research | VOL. 50

N-channel hidden Markov models for combined stressed speech classification and recognition
B.D Womack ... J.H.L Hansen
IEEE Transactions on Speech and Audio Processing | VOL. 7
B.D Womack, et. al.B.D Womack ... J.H.L Hansen
01 Jan 1998
IEEE Transactions on Speech and Audio Processing | VOL. 7

Stressed speech recognition using multi-dimensional hidden Markov models
B.D Womack ... J.H.L Hansen
-
B.D Womack, et. al.B.D Womack ... J.H.L Hansen
14 Dec 1997
14 Dec 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Attention mechanism based LSTM in classification of stressed speech under workload

Abstract

Talk to us

Similar Papers

More From: Intelligent Data Analysis