New Results on Single-Channel Speech Separation Using Sinusoidal Modeling

Pejman Mowlaee,Mads Græsbøll Christensen,Søren Holdt Jensen

doi:10.1109/tasl.2010.2089520

Abstract

We present new results on single-channel speech separation and suggest a new separation approach to improve the speech quality of separated signals from an observed mixture. The key idea is to derive a mixture estimator based on sinusoidal parameters. The proposed estimator is aimed at finding sinusoidal parameters in the form of codevectors from vector quantization (VQ) codebooks pre-trained for speakers that, when combined, best fit the observed mixed signal. The selected codevectors are then used to reconstruct the recovered signals for the speakers in the mixture. Compared to the log-max mixture estimator used in binary masks and the Wiener filtering approach, it is observed that the proposed method achieves an acceptable perceptual speech quality with less cross-talk at different signal-to-signal ratios. Moreover, the method is independent of pitch estimates and reduces the computational complexity of the separation by replacing the short-time Fourier transform (STFT) feature vectors of high dimensionality with sinusoidal feature vectors. We report separation results for the proposed method and compare them with respect to other benchmark methods. The improvements made by applying the proposed method over other methods are confirmed by employing perceptual evaluation of speech quality (PESQ) as an objective measure and a MUSHRA listening test as a subjective evaluation for both speaker-dependent and gender-dependent scenarios.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Audio, Speech, and Language Processing	Publication Date: Jul 1, 2011
Citations: 77	License type: other-oa

R Discovery Prime

R Discovery Prime

New Results on Single-Channel Speech Separation Using Sinusoidal Modeling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing

Lead the way for us

Similar Papers

Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners
Jinqiu Sang ... Stefan Bleeck
Hearing Research | VOL. 327
Jinqiu Sang, et. al.Jinqiu Sang ... Stefan Bleeck
29 Jul 2015
Hearing Research | VOL. 327

A Methodology for Improving PESQ accuracy for Chinese Speech
Fong Chong ... Ian Mcloughlin
-
Fong Chong, et. al.Fong Chong ... Ian Mcloughlin
01 Nov 2005
01 Nov 2005

An Improved Logistic Function for Mapping Raw Scores of Perceptual Evaluation of Speech Quality (PESQ)
A Olatubosun ... Patrick O Olabisi
Journal of Engineering Research and Reports | VOL. 3
A Olatubosun, et. al.A Olatubosun ... Patrick O Olabisi
24 Nov 2018
Journal of Engineering Research and Reports | VOL. 3

Performance analyze of QoE-based speech quality evaluation model
Weiwei Zhang ... Yitong Liu
-
Weiwei Zhang, et. al.Weiwei Zhang ... Yitong Liu
01 Jul 2014
01 Jul 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

New Results on Single-Channel Speech Separation Using Sinusoidal Modeling

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Audio, Speech, and Language Processing