Semi-Supervised Deep Time-Delay Embedded Clustering for Stress Speech Analysis

Barlian Henryranu Prasetio,Hiroki Tamura,Koichi Tanno

doi:10.3390/electronics8111263

Barlian Henryranu Prasetio, Hiroki Tamura + Show 1 more

Open Access

https://doi.org/10.3390/electronics8111263

Copy DOI

Journal: Electronics	Publication Date: Nov 1, 2019
Citations: 5	License type: CC BY 4.0

Affiliation: University of Miyazaki

Abstract

Real stressed speech is affected by various aspects (individual characteristics and environment) so that the stress patterns are diverse and different on each individual. To this end, in our previous work, we performed an unsupervised clustering method that able to self-learning manner by mapping the feature representations of the stress speech and clustering tasks simultaneously, called deep time-delay embedded clustering (DTEC). However, DTEC has not confirmed yet the compatibility between the output class and informational classes. Therefore, we proposed semi-supervised time-delay embedded clustering (SDTEC) as a new framework of semi-supervised in DTEC. SDTEC incorporates the prior information of pairwise constraints in the embedding layer and simultaneously learns the feature representation and the clustering assignments. The prior information was used to guide the clustering procedure so that the points that belong to the incorrect cluster can be corrected. The effectiveness of the proposed SDTEC was evaluated by comparing it with some baseline methods in terms of the clustering error rate (CER). Moreover, to demonstrate SDTEC’s capabilities, we conducted a comprehensive ablation study. Based on experiment results, SDTEC outperformed the baseline methods and achieves state-of-the-art results in semi-supervised clustering.

Highlights

IntroductionStress is an unconscious emotion caused by environmental stimuli [1]
In psychological sciences, stress is an unconscious emotion caused by environmental stimuli [1].The human body responds to stress by releasing hormones that increase heart rates, breathing rates, and muscle tension [2]
We assess the effectiveness of the proposed supervised deep time-delay embedded clustering (SDTEC) in categorizing the stress speech data of Speech Under Simulated and Actual Stress (SUSAS) dataset in term of clustering error rate (CER)

Summary

Introduction

Stress is an unconscious emotion caused by environmental stimuli [1]. In real situations, stress characteristics are diverse and have different patterns for each individual due to various aspects such as characteristics, gender, experience background, and emotional tendencies [7] In this decade, unsupervised clustering has been explored by defining an effective objective in a self-learning manner to categorize stress speech data [8,9,10]. In our previous work [20], we proposed a new deep clustering architecture that uses the time-delay neural network (TDNN) structure to built the autoencoder. We named it the deep time-delay embedded clustering (DTEC).

Related Works

Semi-Supervised Deep Time-Delay Embedded Clustering

Nonlinear Transformation

Stress Speech Recognition Model Based Pairwise Constraints

Objective Function of the Network

Dataset

Experiment Settings

Baseline Clustering Methods

Results and Discussion

Evaluation Result

Method

Ablation Study

The Effect of Losses

The Effect of the Number of Constraints

Conclusions

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Semi-Supervised Deep Time-Delay Embedded Clustering for Stress Speech Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

A review on semi-supervised clustering
Jianghui Cai ... Yuqing Yang
Information Sciences | VOL. 632
Jianghui Cai, et. al.Jianghui Cai ... Yuqing Yang
05 Mar 2023
Information Sciences | VOL. 632

Research Progress on Semi-Supervised Clustering
Yue Qin ... Lijuan Wang
Cognitive Computation | VOL. 11
Yue Qin, et. al.Yue Qin ... Lijuan Wang
17 Jul 2019
Cognitive Computation | VOL. 11

Semi-supervised hierarchical ensemble clustering based on an innovative distance metric and constraint information
Baohua Shen ... Gholamreza Ahmadi
Engineering Applications of Artificial Intelligence | VOL. 124
Baohua Shen, et. al.Baohua Shen ... Gholamreza Ahmadi
12 Jun 2023
Engineering Applications of Artificial Intelligence | VOL. 124

Semi-supervised consensus clustering for gene expression data analysis.
Yunli Wang ... Youlian Pan
BioData mining | VOL. 7
Yunli Wang, et. al.Yunli Wang ... Youlian Pan
08 May 2014
BioData mining | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-Supervised Deep Time-Delay Embedded Clustering for Stress Speech Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Electronics