VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild

Florian B Pokorny,Julian Linke,Nico Seddiki,Simon Lohrmann,Claus Gerstenberger,Katja Haspl,Marlies Feiner,Florian Eyben,Martin Hagmüller,Barbara Schuppler,Gernot Kubin,Markus Gugatschka

doi:10.1016/j.bspc.2023.105595

Abstract

Objective:Voice problems that arise during everyday vocal use can hardly be captured by standard outpatient voice assessments. In preparation for a digital health application to automatically assess longitudinal voice data ‘in the wild’ – the VocDoc, the aim of this paper was to study vocal fatigue from the speaker’s perspective, the healthcare professional’s perspective, and the ‘machine’s’ perspective. Methods:We collected data of four voice healthy speakers completing a 90-min reading task. Every 10 min the speakers were asked about subjective voice characteristics. Then, we elaborated on the task of elapsed speaking time recognition: We carried out listening experiments with speech and language therapists and employed random forests on the basis of extracted acoustic features. We validated our models speaker-dependently and speaker-independently and analysed underlying feature importances. For an additional, clinical application-oriented scenario, we extended our dataset for lecture recordings of another two speakers. Results:Self- and expert-assessments were not consistent. With mean F1 scores up to 0.78, automatic elapsed speaking time recognition worked reliably in the speaker-dependent scenario only. A small set of acoustic features – other than features previously reported to reflect vocal fatigue – was found to universally describe long-term variations of the voice. Conclusion:Vocal fatigue seems to have individual effects across different speakers. Machine learning has the potential to automatically detect and characterise vocal changes over time. Significance:Our study provides technical underpinnings for a future mobile solution to objectively capture pathological long-term voice variations in everyday life settings and make them clinically accessible.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control

Lead the way for us

Journal: Biomedical Signal Processing and Control	Publication Date: Oct 25, 2023
License type: cc-by

Similar Papers

Towards a Small Set of Robust Acoustic Features for Emotion Recognition: Challenges
Marie Tahon ... Laurence Devillers
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Marie Tahon, et. al.Marie Tahon ... Laurence Devillers
01 Jan 2015
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Investigation of the Relationship Between Vocal Fatigue, Quality of Life, and Compliance With Vocal Hygiene in Professional Voice Users.
Seren Düzenli-Öztürk ... Elif Meryem Ünsal
Journal of voice : official journal of the Voice Foundation | VOL. -
Seren Düzenli-Öztürk, et. al.Seren Düzenli-Öztürk ... Elif Meryem Ünsal
01 Nov 2023
Journal of voice : official journal of the Voice Foundation | VOL. -

Vocal Health Practices Among School Teachers: A Study From Chennai, India
Monica Sathyanarayan ... Aishwarya Nallamuthu
Journal of Voice | VOL. 33
Monica Sathyanarayan, et. al.Monica Sathyanarayan ... Aishwarya Nallamuthu
20 Aug 2018
Journal of Voice | VOL. 33

Factors Influencing Teachers’ Experience of Vocal Fatigue and Classroom Voice Amplification
Russell E Banks ... Eric Hunter
Journal of Voice | VOL. -
Russell E Banks, et. al.Russell E Banks ... Eric Hunter
01 Aug 2022
Journal of Voice | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

VocDoc, what happened to my voice? Towards automatically capturing vocal fatigue in the wild

Abstract

Talk to us

Similar Papers

More From: Biomedical Signal Processing and Control