End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features

Edmilson Morais,Hong-Kwang J Kuo,Samuel Thomas,Zoltan Tuske,Brian Kingsbury

doi:10.1109/icassp39728.2021.9414522

Abstract

Transformer networks and self-supervised pre-training have consistently delivered state-of-art results in the field of natural language processing (NLP); however, their merits in the field of spoken language understanding (SLU) still need further investigation. In this paper we introduce a modular End-to-End (E2E) SLU transformer network based architecture which allows the use of self-supervised pre- trained acoustic features, pre-trained model initialization and multi-task training. Several SLU experiments for predicting intent and entity labels/values using the ATIS dataset are performed. These experiments investigate the interaction of pre-trained model initialization and multi-task training with either traditional filterbank or self-supervised pre-trained acoustic features. Results show not only that self-supervised pre-trained acoustic features outperform filterbank features in almost all the experiments, but also that when these features are used in combination with multi-task training, they almost eliminate the necessity of pre-trained model initialization.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

End-to-End Neural Transformer Based Spoken Language Understanding
Martin Radfar ... Athanasios Mouchtaris
-
Martin Radfar, et. al.Martin Radfar ... Athanasios Mouchtaris
25 Oct 2020
25 Oct 2020

Uni-MIS: United Multiple Intent Spoken Language Understanding via Multi-View Intent-Slot Interaction
Shangjian Yin ... Peijie Huang
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Shangjian Yin, et. al.Shangjian Yin ... Peijie Huang
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Developments in The Field of Natural Language Processing

International Journal of Advanced Research in Computer Science | VOL. 8

30 Apr 2017
International Journal of Advanced Research in Computer Science | VOL. 8

Scope and Challenges in Conversational AI using Transformer Models
Arighna Chakraborty ... Asoke Nath
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -
Arighna Chakraborty, et. al. Arighna Chakraborty ... Asoke Nath
15 Dec 2021
International Journal of Scientific Research in Computer Science, Engineering and Information Technology | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features

Abstract

Talk to us

Similar Papers