Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor

Yu Nakagome,Tetsuji Ogawa,Masahito Togami,Tetsunori Kobayashi

doi:10.1109/icassp40776.2020.9053629

Abstract

In this investigation, a deep neural network (DNN) based speech extraction method is proposed to enhance a speech signal propagating from the desired direction. The proposed method integrates knowledge based on a sound propagation model and the time-varying characteristics of a speech source, into a DNN-based separation framework. This approach outputs a separated speech source using time-varying spatial filtering, which achieves superior speech extraction performance compared with time-invariant spatial filtering. Given that the gradient of all modules can be calculated, back-propagation can be performed to maximize the speech quality of the output signal in an end-to-end manner. Guided information is also modeled based on the sound propagation model, which facilitates disentangled representations of the target speech source and noise signals. The experimental results demonstrate that the proposed method can extract the target speech source more accurately than conventional DNN-based speech source separation and conventional speech extraction using time-invariant spatial filtering.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Method and practice of microphone array speech source localization based on sound propagation modeling
Gang Meng ... Yansong Wang
Applied Mathematics and Nonlinear Sciences | VOL. 9
Gang Meng, et. al.Gang Meng ... Yansong Wang
01 Jan 2024
Applied Mathematics and Nonlinear Sciences | VOL. 9

Optimization of wind farm operation with a noise constraint
Camilla Marie Nyborg ... Pierre-Elouan Réthoré
Wind Energy Science | VOL. 8
Camilla Marie Nyborg, et. al.Camilla Marie Nyborg ... Pierre-Elouan Réthoré
28 Feb 2023
Wind Energy Science | VOL. 8

Single-Microphone Speech Separation: The use of Speech Models
S. W.
-
S. W.S. W.
23 Jun 2011
23 Jun 2011

Binaural scene analysis : localization, detection and recognition of speakers in complex acoustic scenes

-

16 Apr 2013
16 Apr 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor

Abstract

Talk to us

Similar Papers