Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

Jiangyu Han,Xinyuan Zhou,Yijie Li,Yanhua Long

doi:10.1109/icassp39728.2021.9414244

Abstract

The end-to-end approaches for single-channel target speech extraction have attracted widespread attention. However, the studies for end-to-end multi-channel target speech extraction are still relatively limited. In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech. The first one is using a target speech adaptation layer in a parallel encoder architecture. The second one is designing a channel decorrelation mechanism to extract the inter-channel differential information to enhance the multi-channel encoder representation. We compare the proposed methods with two strong state-of-the-art baselines. Experimental results on the multi-channel reverberant WSJ0 2-mix dataset demonstrate that our proposed methods achieve up to 11.2% and 11.5% relative improvements in SDR and SiSDR respectively, which are the best reported results on this task to the best of our knowledge.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction
Jiangyu Han ... Yanhua Long
-
Jiangyu Han, et. al.Jiangyu Han ... Yanhua Long
30 Aug 2021
30 Aug 2021

X-SEPFORMER: End-To-End Speaker Extraction Network with Explicit Optimization on Speaker Confusion
Kai Liu ... Ziqing Du
-
Kai Liu, et. al.Kai Liu ... Ziqing Du
04 Jun 2023
04 Jun 2023

Attention-Based Scaling Adaptation for Target Speech Extraction
Jiangyu Han ... Wei Rao
-
Jiangyu Han, et. al.Jiangyu Han ... Wei Rao
13 Dec 2021
13 Dec 2021

Analysis of Impact of Emotions on Target Speech Extraction and Speech Separation
Jan Svec ... Katerina Zmolikova
-
Jan Svec, et. al.Jan Svec ... Katerina Zmolikova
05 Sep 2022
05 Sep 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

Abstract

Talk to us

Similar Papers