A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

Dongyue Guo,Jianwei Zhang,Bo Yang,Yi Lin

doi:10.1145/3572792

Abstract

Automatic spoken instruction understanding (SIU) of the controller-pilot conversations in the air traffic control (ATC) requires not only recognizing the words and semantics of the speech but also determining the role of the speaker. However, few of the published works on the automatic understanding systems in air traffic communication focus on speaker role identification (SRI). In this article, we formulate the SRI task of controller-pilot communication as a binary classification problem. Furthermore, the text-based, speech-based, and speech-and-text-based multi-modal methods are proposed to achieve a comprehensive comparison of the SRI task. To ablate the impacts of the comparative approaches, various advanced neural network architectures are applied to optimize the implementation of text-based and speech-based methods. Most importantly, a multi-modal speaker role identification network (MMSRINet) is designed to achieve the SRI task by considering both the speech and textual modality features. To aggregate modality features, the modal fusion module is proposed to fuse and squeeze acoustic and textual representations by modal attention mechanism and self-attention pooling layer, respectively. Finally, the comparative approaches are validated on the ATCSpeech corpus collected from a real-world ATC environment. The experimental results demonstrate that all the comparative approaches worked for the SRI task, and the proposed MMSRINet shows competitive performance and robustness compared with the other methods on both seen and unseen data, achieving 98.56% and 98.08% accuracy, respectively.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing

Lead the way for us

Journal: ACM Transactions on Asian and Low-Resource Language Information Processing	Publication Date: Mar 24, 2023
Citations: 4

Similar Papers

Rapport management in air traffic control in Malaysian aviation discourse
Shamala Paramasivam
Journal of Asian Pacific Communication | VOL. 21
Shamala ParamasivamShamala Paramasivam
16 Mar 2011
Journal of Asian Pacific Communication | VOL. 21

Aviation English in South African airspace
Salome Coertze ... Chris R Burger
Stellenbosch Papers in Linguistics Plus | VOL. 42
Salome Coertze, et. al.Salome Coertze ... Chris R Burger
27 Jan 2014
Stellenbosch Papers in Linguistics Plus | VOL. 42

Air traffic communications in routine and emergency contexts: A case study of Flight 1549 ‘miracle on the Hudson’
Angela Cora Garcia
Journal of Pragmatics | VOL. 106
Angela Cora GarciaAngela Cora Garcia
05 Nov 2016
Air traffic communications in routine and emergency contexts: A case study of Flight 1549 ‘miracle on the Hudson’
Angela Cora Garcia

Stochastic properties of total air traffic control voice communication time
William J Dunlay
Transportation Research | VOL. 9
William J DunlayWilliam J Dunlay
01 Oct 1975
Transportation Research | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparative Study of Speaker Role Identification in Air Traffic Communication Using Deep Learning Approaches

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Asian and Low-Resource Language Information Processing