Structured Sparsity Models for Reverberant Speech Separation

Afsaneh Asaei,Herve Bourlard,Volkan Cevher,Mohammad Golbabaee

doi:10.1109/taslp.2013.2297012

Abstract

We tackle the speech separation problem through modeling the acoustics of the reverberant chambers. Our approach exploits structured sparsity models to perform speech recovery and room acoustic modeling from recordings of concurrent unknown sources. The speakers are assumed to lie on a two-dimensional plane and the multipath channel is characterized using the image model. We propose an algorithm for room geometry estimation relying on localization of the early images of the speakers by sparse approximation of the spatial spectrum of the virtual sources in a free-space model. The images are then clustered exploiting the low-rank structure of the spectro-temporal components belonging to each source. This enables us to identify the early support of the room impulse response function and its unique map to the room geometry. To further tackle the ambiguity of the reflection ratios, we propose a novel formulation of the reverberation model and estimate the absorption coefficients through a convex optimization exploiting joint sparsity model formulated upon spatio-spectral sparsity of concurrent speech representation. The acoustic parameters are then incorporated for separating individual speech signals through either structured sparse recovery or inverse filtering the acoustic channels. The experiments conducted on real data recordings of spatially stationary sources demonstrate the effectiveness of the proposed approach for speech separation and recognition.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Structured Sparsity Models for Reverberant Speech Separation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing

Lead the way for us

Journal: IEEE/ACM Transactions on Audio, Speech, and Language Processing	Publication Date: Mar 1, 2014
Citations: 80

Similar Papers

Language Identification in Overlapped Multi-lingual Speeches
Zuhragvl Aysa ... Askar Hamdulla
-
Zuhragvl Aysa, et. al.Zuhragvl Aysa ... Askar Hamdulla
22 Jul 2022
22 Jul 2022

A Neural Network Based Regression Approach for Recognizing Simultaneous Speech
Weifeng Li ... Hervé Bourlard
-
Weifeng Li, et. al.Weifeng Li ... Hervé Bourlard
08 Sep 2008
08 Sep 2008

SVM-based separation of unvoiced-voiced speech in cochannel conditions
Ke Hu ... Deliang Wang
-
Ke Hu, et. al.Ke Hu ... Deliang Wang
01 Mar 2012
01 Mar 2012

An empirical investigation of sparse log-linear models for improved dialogue act classification
Yun-Nung Chen ... William Yang Wang
-
Yun-Nung Chen, et. al.Yun-Nung Chen ... William Yang Wang
01 May 2013
01 May 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structured Sparsity Models for Reverberant Speech Separation

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Audio, Speech, and Language Processing