Detecting Vocal Fatigue with Neural Embeddings.

Sebastian P Bayerl,Tobias Bocklet,Korbinian Riedhammer,Ilja Baumann,Dominik Wagner

doi:10.1016/j.jvoice.2023.01.012

Abstract

Vocal fatigue refers to the feeling of tiredness and weakness of voice due to extended utilization. This paper investigates the effectiveness of neural embeddings for the detection of vocal fatigue. We compare x-vectors, ECAPA-TDNN, and wav2vec 2.0 embeddings on a corpus of academic spoken English. Low-dimensional mappings of the data reveal that neural embeddings capture information about the change in vocal characteristics of a speaker during prolonged voice usage. We show that vocal fatigue can be reliably predicted using all three types of neural embeddings after 40 minutes of continuous speaking when temporal smoothing and normalization are applied to the extracted embeddings. We employ support vector machines for classification and achieve accuracy scores of 81% using x-vectors, 85% using ECAPA-TDNN embeddings, and 82% using wav2vec 2.0 embeddings as input features. We obtain an accuracy score of 76%, when the trained system is applied to a different speaker and recording environment without any adaptation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Detecting Vocal Fatigue with Neural Embeddings.

Abstract

Talk to us

Similar Papers

More From: Journal of voice : official journal of the Voice Foundation

Lead the way for us

Journal: Journal of voice : official journal of the Voice Foundation	Publication Date: Feb 1, 2023
Citations: 4

Similar Papers

Accent in Speech Samples: Support Vector Machines for Classification and Rule Extraction
Carol Pedersen ... Joachim Diederich
-
Carol Pedersen, et. al.Carol Pedersen ... Joachim Diederich
01 Jan 2008
01 Jan 2008

A Contrastive Study of Boosters in a Corpus of Academic Spoken English
Ali Khudhair Abd Oun Wazni ... Sara Mansouri
BELT - Brazilian English Language Teaching Journal | VOL. 14
Ali Khudhair Abd Oun Wazni, et. al.Ali Khudhair Abd Oun Wazni ... Sara Mansouri
12 Dec 2023
BELT - Brazilian English Language Teaching Journal | VOL. 14

Classification of Vocal Fatigue Using sEMG: Data Imbalance, Normalization, and the Role of Vocal Fatigue Index Scores
Yixiang Gao ... Maria Dietrich
Applied Sciences | VOL. 11
Yixiang Gao, et. al.Yixiang Gao ... Maria Dietrich
11 May 2021
Applied Sciences | VOL. 11

Preliminary Study on the Quantitative Analysis of Vocal Loading Effects on Vocal Fold Dynamics Using Phonovibrograms
Joerg Lohscheller ... Andrew J Mcwhorter
Annals of Otology, Rhinology & Laryngology | VOL. 117
Joerg Lohscheller, et. al.Joerg Lohscheller ... Andrew J Mcwhorter
01 Jul 2008
Annals of Otology, Rhinology & Laryngology | VOL. 117

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Detecting Vocal Fatigue with Neural Embeddings.

Abstract

Talk to us

Similar Papers

More From: Journal of voice : official journal of the Voice Foundation