Improving Spoken Language Understanding by Enhancing Text Representation

Thai Binh Nguyen

doi:10.1109/icassp43922.2022.9746913

Abstract

Language Model (LM) which is commonly trained on a large corpora has been proven the robustness and effectiveness for tasks of Natural Language Understanding (NLU) in many applications such as virtual assistant or recommendation system. These applications normally receive outputs of automatic speech recognition (ASR) module as spoken form inputs which generally lack both lexical and syntactic information. Pre-trained language models, for example BERT [1] or XLM-RoBERTa [2], which are often pre-trained on written form corpora perform decreased performance on NLU tasks with spoken form inputs. In this paper, we propose a novel model to train a language model namely CapuBERT that is able to deal with spoken form input from ASR module. The experimental results show that the proposed model achieves state-of-the-art results on several NLU tasks included Part-of-speech tagging, Named-entity recognition and Chunking in English, German, and Vietnamese languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving Spoken Language Understanding by Enhancing Text Representation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Amharic Speech Search Using Text Word Query Based on Automatic Sentence-like Segmentation
Getnet Mezgebu Brhanemeskel ... Tewodros Alemu Ayall
Applied Sciences | VOL. 12
Getnet Mezgebu Brhanemeskel, et. al.Getnet Mezgebu Brhanemeskel ... Tewodros Alemu Ayall
18 Nov 2022
Applied Sciences | VOL. 12

A Global Discriminant Joint Training Framework for Robust Speech Recognition
Lujun Li ... Tobias Watzel
-
Lujun Li, et. al.Lujun Li ... Tobias Watzel
01 Nov 2021
01 Nov 2021

Comparative Study of Multiclass Text Classification in Research Proposals Using Pretrained Language Models
Eunchan Lee ... Sangtae Ahn
Applied Sciences | VOL. 12
Eunchan Lee, et. al.Eunchan Lee ... Sangtae Ahn
29 Apr 2022
Applied Sciences | VOL. 12

Analysis of the sensitivity of the End-Of-Turn Detection task to errors generated by the Automatic Speech Recognition process
César Montenegro ... Jose A Lozano
Engineering Applications of Artificial Intelligence | VOL. 100
César Montenegro, et. al.César Montenegro ... Jose A Lozano
15 Feb 2021
Engineering Applications of Artificial Intelligence | VOL. 100

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving Spoken Language Understanding by Enhancing Text Representation

Abstract

Talk to us

Similar Papers