Laughter Detection Research Articles

It is essential for the advancement of human-centered multimodal interfaces to be able to infer the current user's state or communication state. In order to enable a system to do that, the recognition and interpretation of multimodal social signals (i.e., paralinguistic and nonverbal behavior) in real-time applications is required. Since we believe that laughs are one of the most important and widely understood social nonverbal signals indicating affect and discourse quality, we focus in this work on the detection of laughter in natural multiparty discourses. The conversations are recorded in a natural environment without any specific constraint on the discourses using unobtrusive recording devices. This setup ensures natural and unbiased behavior, which is one of the main foci of this work. To compare results of methods, namely Gaussian Mixture Model (GMM) supervectors as input to a Support Vector Machine (SVM), so-called Echo State Networks (ESN), and a Hidden Markov Model (HMM) approach, are utilized in online and offline detection experiments. The SVM approach proves very accurate in the offline classification task, but is outperformed by the ESN and HMM approach in the online detection (F 1 scores: GMM SVM 0.45, ESN 0.63, HMM 0.72). Further, we were able to utilize the proposed HMM approach in a cross-corpus experiment without any retraining with respectable generalization capability (F 1 score: 0.49). The results and possible reasons for these outcomes are shown and discussed in the article. The proposed methods may be directly utilized in practical tasks such as the labeling or the online detection of laughter in conversational data and affect-aware applications.

Emotions can be recognized by audible paralinguistic cues in speech. By detecting these paralinguistic cues that can consist of laughter, a trembling voice, coughs, changes in the intonation contour etc., information about the speaker’s state and emotion can be revealed. This paper describes the development of a gender-independent laugh detector with the aim to enable automatic emotion recognition. Different types of features (spectral, prosodic) for laughter detection were investigated using different classification techniques (Gaussian Mixture Models, Support Vector Machines, Multi Layer Perceptron) often used in language and speaker recognition. Classification experiments were carried out with short pre-segmented speech and laughter segments extracted from the ICSI Meeting Recorder Corpus (with a mean duration of approximately 2 s). Equal error rates of around 3% were obtained when tested on speaker-independent speech data. We found that a fusion between classifiers based on Gaussian Mixture Models and classifiers based on Support Vector Machines increases discriminative power. We also found that a fusion between classifiers that use spectral features and classifiers that use prosodic information usually increases the performance for discrimination between laughter and speech. Our acoustic measurements showed differences between laughter and speech in mean pitch and in the ratio of the durations of unvoiced to voiced portions, which indicate that these prosodic features are indeed useful for discrimination between laughter and speech.

Laughter Detection Research Articles

Related Topics

Articles published on Laughter Detection

Impact of Annotation Modality on Label Quality and Model Performance in the Automatic Assessment of Laughter In-the-Wild

Cascade of Boolean detector combinations

Audio-Facial Laughter Detection in Naturalistic Dyadic Conversations

Laughter Classification Using Deep Rectifier Neural Networks with a Minimal Feature Subset

Automated Laughter Detection From Full-Body Movements

Quantitative Laughter Detection, Measurement, and Classification-A Critical Survey.

1A1-S06 笑い声に応答するロボットのための笑い声認識システムの開発

Detection of Hidden Laughter for Human-agent Interaction

Laughter Detection for an Assisting Tool of Group Conversation

Spotting laughter in natural multiparty conversations

The phraseological answer is one of the methods of humorous discussion (being based on the material E. Gusala’s writing andO. Ilchenko’s writings)

Automatic discrimination between laughter and speech

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Laughter Detection Research Articles

Related Topics

Articles published on Laughter Detection

Impact of Annotation Modality on Label Quality and Model Performance in the Automatic Assessment of Laughter In-the-Wild

Cascade of Boolean detector combinations

Audio-Facial Laughter Detection in Naturalistic Dyadic Conversations

Laughter Classification Using Deep Rectifier Neural Networks with a Minimal Feature Subset

Automated Laughter Detection From Full-Body Movements

Quantitative Laughter Detection, Measurement, and Classification-A Critical Survey.

1A1-S06 笑い声に応答するロボットのための笑い声認識システムの開発

Detection of Hidden Laughter for Human-agent Interaction

Laughter Detection for an Assisting Tool of Group Conversation

Spotting laughter in natural multiparty conversations

The phraseological answer is one of the methods of humorous discussion (being based on the material E. Gusala’s writing andO. Ilchenko’s writings)

Automatic discrimination between laughter and speech