Automatic speech-to-text transcription: evidence from a smartphone survey with voice answers

Jan Karem Höhne,Timo Lenzner,Joshua Claassen

doi:10.1080/13645579.2024.2443633

Jan Karem Höhne, Timo Lenzner + Show 1 more

Open Access

https://doi.org/10.1080/13645579.2024.2443633

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

ABSTRACT Advances in information and communication technology, coupled with a smartphone increase in web surveys, provide new avenues for collecting answers from respondents. Specifically, the microphones of smartphones facilitate the collection of voice instead of text answers to open questions. Speech-to-text transcriptions through Automatic Speech Recognition (ASR) systems pose an efficient way to make voice answers accessible to text-as-data methods. However, there is little evidence on the transcription performance of ASR systems when it comes to voice answers. We therefore investigate the performance of two leading ASR systems – Google’s Cloud Speech-to-Text API and OpenAI’s Whisper – using voice answers to two open questions administered in a smartphone survey in Germany. The results indicate that Whisper produces more accurate transcriptions than Google’s API. Both systems produce similar errors, but these errors are more common for the Google API. However, the Google API is faster than both Whisper and human transcribers.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Automatic speech-to-text transcription: evidence from a smartphone survey with voice answers

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Social Research Methodology

Lead the way for us

Journal: International Journal of Social Research Methodology	Publication Date: Dec 21, 2024
License type: CC BY 4.0

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Ronan Flynn ... Edward Jones
Speech Communication | VOL. 50
Ronan Flynn, et. al.Ronan Flynn ... Edward Jones
20 May 2008
Speech Communication | VOL. 50

Autocorrelation-based Methods for Noise-Robust Speech Recognition
Gholamreza Farahani ... Mohammad Mehdi
-
Gholamreza Farahani, et. al.Gholamreza Farahani ... Mohammad Mehdi
01 Jun 2007
01 Jun 2007

Interaction between people with dysarthria and speech recognition systems: A review
Aisha Jaddoh ... Omer Rana
Assistive Technology | VOL. 35
Aisha Jaddoh, et. al.Aisha Jaddoh ... Omer Rana
16 Apr 2022
Assistive Technology | VOL. 35

Native Language Identification from Spoken Indian English
...
Trends in Electrical Engineering | VOL. 9
, et. al. ...
30 Oct 2019
Trends in Electrical Engineering | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Automatic speech-to-text transcription: evidence from a smartphone survey with voice answers

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Social Research Methodology