Acoustic compression in Zoom audio does not compromise voice recognition performance

Valeriia Perepelytsia,Volker Dellwo

doi:10.1038/s41598-023-45971-x

Abstract

Human voice recognition over telephone channels typically yields lower accuracy when compared to audio recorded in a studio environment with higher quality. Here, we investigated the extent to which audio in video conferencing, subject to various lossy compression mechanisms, affects human voice recognition performance. Voice recognition performance was tested in an old–new recognition task under three audio conditions (telephone, Zoom, studio) across all matched (familiarization and test with same audio condition) and mismatched combinations (familiarization and test with different audio conditions). Participants were familiarized with female voices presented in either studio-quality (N = 22), Zoom-quality (N = 21), or telephone-quality (N = 20) stimuli. Subsequently, all listeners performed an identical voice recognition test containing a balanced stimulus set from all three conditions. Results revealed that voice recognition performance (dʹ) in Zoom audio was not significantly different to studio audio but both in Zoom and studio audio listeners performed significantly better compared to telephone audio. This suggests that signal processing of the speech codec used by Zoom provides equally relevant information in terms of voice recognition compared to studio audio. Interestingly, listeners familiarized with voices via Zoom audio showed a trend towards a better recognition performance in the test (p = 0.056) compared to listeners familiarized with studio audio. We discuss future directions according to which a possible advantage of Zoom audio for voice recognition might be related to some of the speech coding mechanisms used by Zoom.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Oct 31, 2023
Citations: 2	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Acoustic compression in Zoom audio does not compromise voice recognition performance

Abstract

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

Voice Activity Detection Using an Improved Unvoiced Feature Normalization Process in Noisy Environments
Kyungyong Chung ... Sang Yeob Oh
Wireless Personal Communications | VOL. 89
Kyungyong Chung, et. al.Kyungyong Chung ... Sang Yeob Oh
31 Dec 2015
Wireless Personal Communications | VOL. 89

Recognizing people by their voices: An fMRI-study of healthy people and patients after stroke
Y Paelecke-Habermann ... C Gaul
Klinische Neurophysiologie | VOL. 39
Y Paelecke-Habermann, et. al.Y Paelecke-Habermann ... C Gaul
01 Mar 2008
Klinische Neurophysiologie | VOL. 39

Evaluation of AI System’s Voice Recognition Performance in Social Conversation
Sweta Kumari Barnwal ... Pooja Gupta
-
Sweta Kumari Barnwal, et. al.Sweta Kumari Barnwal ... Pooja Gupta
14 Dec 2022
14 Dec 2022

The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.
Virginia Aglieri ... Pascal Belin
Behavior Research Methods | VOL. 49
Virginia Aglieri, et. al.Virginia Aglieri ... Pascal Belin
28 Jan 2016
Behavior Research Methods | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic compression in Zoom audio does not compromise voice recognition performance

Abstract

Talk to us

Similar Papers

More From: Scientific Reports