Where Are We in Semantic Concept Extraction for Spoken Language Understanding?

Sahar Ghannay,Bassam Jabaian,Yannick Estève,Salima Mdhaffar,Antoine Caubrière,Gaëlle Laperrière

doi:10.1007/978-3-030-87802-3_19

Abstract

Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with the emergence of end-to-end neural approaches. Spoken language understanding refers to natural language processing tasks related to semantic extraction from speech signal, like named entity recognition from speech or slot filling task in a context of human-machine dialogue. Classically, SLU tasks were processed through a cascade approach that consists in applying, firstly, an automatic speech recognition process, followed by a natural language processing module applied to the automatic transcriptions. These three last years, end-to-end neural approaches, based on deep neural networks, have been proposed in order to directly extract the semantics from speech signal, by using a single neural model. More recent works on self-supervised training with unlabeled data open new perspectives in term of performance for automatic speech recognition and natural language processing. In this paper, we present a brief overview of the recent advances on the French MEDIA benchmark dataset for SLU, with or without the use of additional data. We also present our last results that significantly outperform the current state-of-the-art with a Concept Error Rate (CER) of 11.2%, instead of 13.6% for the last state-of-the-art system presented this year.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Where Are We in Semantic Concept Extraction for Spoken Language Understanding?

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Joint Spoken Language Understanding and Domain Adaptive Language Modeling
Huifeng Zhang ... Shuai Fan
-
Huifeng Zhang, et. al.Huifeng Zhang ... Shuai Fan
01 Jan 2018
01 Jan 2018

Bridging Speech and Textual Pre-Trained Models With Unsupervised ASR
Jiatong Shi ... Shinji Watanabe
-
Jiatong Shi, et. al.Jiatong Shi ... Shinji Watanabe
04 Jun 2023
04 Jun 2023

Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech
Pu Wang ... Hugo Van Hamme
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023
Pu Wang, et. al.Pu Wang ... Hugo Van Hamme
07 Apr 2023
EURASIP Journal on Audio, Speech, and Music Processing | VOL. 2023

Novel speech processing techniques for robust automatic speech recognition

-

01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Where Are We in Semantic Concept Extraction for Spoken Language Understanding?

Abstract

Talk to us

Similar Papers