Use of bimodal coherence to resolve the permutation problem in convolutive BSS

Qingju Liu,Wenwu Wang,Philip Jackson

doi:10.1016/j.sigpro.2011.11.007

Abstract

Recent studies show that facial information contained in visual speech can be helpful for the performance enhancement of audio-only blind source separation (BSS) algorithms. Such information is exploited through the statistical characterization of the coherence between the audio and visual speech using, e.g., a Gaussian mixture model (GMM). In this paper, we present three contributions. With the synchronized features, we propose an adapted expectation maximization (AEM) algorithm to model the audio–visual coherence in the off-line training process. To improve the accuracy of this coherence model, we use a frame selection scheme to discard nonstationary features. Then with the coherence maximization technique, we develop a new sorting method to solve the permutation problem in the frequency domain. We test our algorithm on a multimodal speech database composed of different combinations of vowels and consonants. The experimental results show that our proposed algorithm outperforms traditional audio-only BSS, which confirms the benefit of using visual speech to assist in separation of the audio.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Use of bimodal coherence to resolve the permutation problem in convolutive BSS

Abstract

Talk to us

Similar Papers

More From: Signal Processing

Lead the way for us

Journal: Signal Processing	Publication Date: Nov 17, 2011
Citations: 43

Similar Papers

Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS
Qingju Liu ... Philip Jackson
Lecture Notes in Computer Science | VOL. -
Qingju Liu, et. al.Qingju Liu ... Philip Jackson
01 Jan 2009
Lecture Notes in Computer Science | VOL. -

Robust feature selection for scaling ambiguity reduction in audio-visual convolutive BSS
...
-
, et. al. ...
29 Aug 2011
29 Aug 2011

Single-channel blind source separation based on attentional generative adversarial network
Xiao Sun ... Tianyu Zhao
Journal of Ambient Intelligence and Humanized Computing | VOL. 13
Xiao Sun, et. al.Xiao Sun ... Tianyu Zhao
18 Nov 2020
Journal of Ambient Intelligence and Humanized Computing | VOL. 13

Scaled Natural Gradient Algorithms for Instantaneous and Convolutive Blind Source Separation
Scott C Douglas ... Malay Gupta
-
Scott C Douglas, et. al.Scott C Douglas ... Malay Gupta
01 Apr 2007
01 Apr 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Use of bimodal coherence to resolve the permutation problem in convolutive BSS

Abstract

Talk to us

Similar Papers

More From: Signal Processing