On the Effects of Automatic Transcription and Segmentation Errors in Hungarian Spoken Language Processing

György Szaszák,Máté Ákos Tündik,Valér Kaszás

doi:10.3311/ppee.14052

Abstract

Emerging Artificial Intelligence (AI) technology has brought machines to reach an equal or even superior level compared to human capabilities in several fields; nevertheless, among many other fields, making a computer able to understand human language still remains a challenge. When dealing with speech understanding, Automatic Speech Recognition (ASR) is used to generate transcripts, which are processed with text-based tools targeting Spoken Language Understanding (SLU). Depending on the ASR quality (which further depends on speech quality, the complexity of the topic, environment etc.), transcripts contain errors, which propagate further into the processing pipeline. Subjective tests show on the other hand, that humans understand quite well ASR-closed captions, despite the word and punctuation errors. Through word embedding based semantic parsing, the present paper is interested in quantifying the semantic bias introduced by ASR error propagation. As a special use case, speech summarization is also evaluated with regard to ASR error propagation. We show, that despite the higher word error rates seen with the highly inflectional Hungarian, the semantic space suffers least impact than the difference in Word Error Rate would suggest.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On the Effects of Automatic Transcription and Segmentation Errors in Hungarian Spoken Language Processing

Abstract

Talk to us

Similar Papers

More From: Periodica Polytechnica Electrical Engineering and Computer Science

Lead the way for us

Similar Papers

Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech
Pu Wang ... Hugo Van Hamme
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023
Pu Wang, et. al.Pu Wang ... Hugo Van Hamme
07 Apr 2023
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023

Joint Spoken Language Understanding and Domain Adaptive Language Modeling
Huifeng Zhang ... Shuai Fan
-
Huifeng Zhang, et. al.Huifeng Zhang ... Shuai Fan
01 Jan 2018
01 Jan 2018

Voice to Action: Spoken Language Understanding for Memory-Constrained Systems
Ashutosh Gupta ... Shatrughan Singh
-
Ashutosh Gupta, et. al.Ashutosh Gupta ... Shatrughan Singh
13 Dec 2021
13 Dec 2021

Analyzing the Effects of Transcription Errors on Summary Generation of Bengali Spoken Documents
Priyanjana Chowdhury ... Utpal Sharma
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -
Priyanjana Chowdhury, et. al.Priyanjana Chowdhury ... Utpal Sharma
17 Jul 2024
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On the Effects of Automatic Transcription and Segmentation Errors in Hungarian Spoken Language Processing

Abstract

Talk to us

Similar Papers

More From: Periodica Polytechnica Electrical Engineering and Computer Science