Air traffic control speech recognition system cross-task and speaker adaptation

R De Cordoba,J.M Pardo,J.M Montero,J Macias-Guarasa,J Ferreiros,L.F D'Haro,R San-Segundo,F Fernandez

doi:10.1109/maes.2006.1705165

Abstract

We present an overview of the most common techniques used in automatic speech recognition to adapt a general system to a different environment (known as cross-task adaptation) such as in an air traffic control system (ATC). The conditions present in ATC are very specific: very spontaneous, the presence of noise, and high speed speech. So, with a typical speech recognizer the recognition results are unsatisfactory. We have to decide on the best option for the modeling: to develop acoustic models specific to those conditions from scratch using the data available for the new environment, or to carry out cross-task adaptation starting from reliable HMM models (usually requiring less data in the target domain). We begin with a description of the main techniques considered for cross-task adaptation, namely maximum a posteriori (MAP), maximum likelihood linear regression (MLLR), and the two together. We have applied each in two speech recognizers for air traffic control tasks, one for spontaneous speech and the other for a command interface. We show the performance of these techniques and compare them with the development of a new system from scratch. We also show the results obtained for speaker adaptation using a variable amount of adaptation data. The main conclusion is that MLLR can outperform MAP when a large number of transforms is used, and MLLR followed by MAP is the best option. All of these techniques are better than developing a new system from scratch, showing the effectiveness of mean and variance adaptation

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Air traffic control speech recognition system cross-task and speaker adaptation

Abstract

Talk to us

Similar Papers

More From: IEEE Aerospace and Electronic Systems Magazine

Lead the way for us

Journal: IEEE Aerospace and Electronic Systems Magazine	Publication Date: Sep 1, 2006
Citations: 18

Similar Papers

Automation and Systems Issues in Air Traffic Control
John A Wise ... Marvin L Smith
-
John A Wise, et. al.John A Wise ... Marvin L Smith
01 Jan 1991
01 Jan 1991

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique
Haipeng Wang ... Yonghong Yan
-
Haipeng Wang, et. al.Haipeng Wang ... Yonghong Yan
01 Dec 2009
01 Dec 2009

Effective speaker adaptations for speaker verification
Sungjoo Ahn ... Hanseok Ko
-
Sungjoo Ahn, et. al. Sungjoo Ahn ... Hanseok Ko
05 Jun 2000
05 Jun 2000

Speaker normalization and adaptation based on linear transformation
J Ishii ... M Tonomura
-
J Ishii, et. al.J Ishii ... M Tonomura
21 Apr 1997
21 Apr 1997

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Air traffic control speech recognition system cross-task and speaker adaptation

Abstract

Talk to us

Similar Papers

More From: IEEE Aerospace and Electronic Systems Magazine