Solving permutations in frequency-domain for blind separation of an arbitrary number of speech sources

Iván Durán-Díaz,Pablo Aguilera,Auxiliadora Sarmiento,Sergio Cruces

doi:10.1121/1.3678657

Solving permutations in frequency-domain for blind separation of an arbitrary number of speech sources

Iván Durán-Díaz, Pablo Aguilera + Show 2 more

Open Access

https://doi.org/10.1121/1.3678657

Copy DOI

Journal: The Journal of the Acoustical Society of America	Publication Date: Jan 23, 2012
Citations: 4	License type: cc-by-nc-nd

Affiliation: Universidad de Sevilla

#Speech Sources In Reverberant Environments #Sources In Reverberant Environments + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Blind separation of speech sources in reverberant environments is usually performed in the time-frequency domain, which gives rise to the permutation problem: the different ordering of estimated sources for different frequency components. A two-stage method to solve permutations with an arbitrary number of sources is proposed. The suggested procedure is based on the spectral consistency of the sources. At the first stage frequency bins are compared with each other, while at the second stage the neighboring frequencies are emphasized. Experiments for perfect separation situations and for live recordings show that the proposed method improves the results of existing approaches.

Full Text