Enhancing and Adversarial: Improve ASR with Speaker Labels

Wei Zhou,Haotian Wu,Ralf Schlüter,Mohammad Zeineldeen,Christoph Lüscher,Jingjing Xu,Hermann Ney

doi:10.1109/icassp49357.2023.10096722

Abstract

ASR can be improved by multi-task learning (MTL) with domain enhancing or domain adversarial training, which are two opposite objectives with the aim to increase/decrease domain variance towards domain-aware/agnostic ASR, respectively. In this work, we study how to best apply these two opposite objectives with speaker labels to improve conformer-based ASR. We also propose a novel adaptive gradient reversal layer for stable and effective adversarial training without tuning effort. Detailed analysis and experimental verification are conducted to show the optimal positions in the ASR neural network (NN) to apply speaker enhancing and adversarial training. We also explore their combination for further improvement, achieving the same performance as i-vectors plus adversarial training. Our best speaker-based MTL achieves 7% relative improvement on the Switchboard Hub5’00 set. We also investigate the effect of such speaker-based MTL w.r.t. cleaner dataset and weaker ASR NN.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Enhancing and Adversarial: Improve ASR with Speaker Labels

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Increasing-Margin Adversarial (IMA) training to improve adversarial robustness of neural networks
Linhai Ma ... Liang Liang
Computer Methods and Programs in Biomedicine | VOL. 240
Linhai Ma, et. al.Linhai Ma ... Liang Liang
24 Jun 2023
Computer Methods and Programs in Biomedicine | VOL. 240

ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
Ruibin Wang ... Dacheng Tao
-
Ruibin Wang, et. al.Ruibin Wang ... Dacheng Tao
01 Jun 2022
01 Jun 2022

A Domain Adaptive Adversarial Training Method Based on Self-Supervised Learning
Chuqing Sun
-
Chuqing SunChuqing Sun
01 Aug 2022
01 Aug 2022

A Novel Adversarial Training Scheme for Deep Neural Network based Speech Enhancement
Samuele Cornell ... Stefano Squartini
-
Samuele Cornell, et. al.Samuele Cornell ... Stefano Squartini
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Enhancing and Adversarial: Improve ASR with Speaker Labels

Abstract

Talk to us

Similar Papers