Joint Amplitude and Phase Refinement for Monaural Source Separation

Yoshiki Masuyama,Kohei Yatabe,Yasuhiro Oikawa,Kento Nagatomo

doi:10.1109/lsp.2020.3031464

Yoshiki Masuyama, Kohei Yatabe + Show 2 more

Open Access

https://doi.org/10.1109/lsp.2020.3031464

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2020
Citations: 39	License type: CC BY 4.0

Affiliation: Waseda University

Abstract

Monaural source separation is often conducted by manipulating the amplitude spectrogram of a mixture (e.g., via time-frequency masking and spectral subtraction). The obtained amplitudes are converted back to the time domain by using the phase of the mixture or by applying phase reconstruction. Although phase reconstruction performs well for the true amplitudes, its performance is degraded when the amplitudes contain error. To deal with this problem, we propose an optimization-based method to refine both amplitudes and phases based on the given amplitudes. It aims to find time-domain signals whose amplitude spectrograms are close to the given ones in terms of the generalized alpha-beta divergences. To solve the optimization problem, the alternating direction method of multipliers (ADMM) is utilized. We confirmed the effectiveness of the proposed method through speech-nonspeech separation in various conditions.

Highlights

M ONAURAL source separation (MSS) aims to decompose a single-channel mixture signal into each source signal
Griffin–Lim algorithm (GLA) modifies the phase of each separated signal based on the short-time Fourier transform (STFT) consistency: the reconstructed complex STFT coefficient should retain the neighborhood relation caused by the overlapped window of STFT [12]
The multiple input spectrogram inversion (MISI) [13] further considered the mixture consistency [17]: a sum of separated signals should coincide with the mixture

Summary

INTRODUCTION

M ONAURAL source separation (MSS) aims to decompose a single-channel mixture signal into each source signal. Due to the use of the mixed phase, the obtained signals contain interference even when the amplitudes are ideally separated. To tackle this problem, various phase reconstruction methods have been presented [10]–[16]. In MSS, the estimated amplitudes often contain error, which significantly impairs the performance of MISI This is because it keeps the given amplitudes and only attempts to reconstruct phases that are appropriate for the amplitudes in terms of STFT and mixture consistencies. The optimization problem aims to find the separated time-domain signals whose amplitude spectrograms are close to the given ones while considering the mixture consistency as a regularization. The effectiveness and robustness of the proposed method were confirmed by speech-nonspeech separation using various amplitude estimation methods

PRELIMINARIES

Problem Formulation

Relation to Existing Methods

Experimental Conditions

Experimental Results

CONCLUSION

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Joint Amplitude and Phase Refinement for Monaural Source Separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Similar Papers

Griffin–Lim Like Phase Recovery via Alternating Direction Method of Multipliers
Yoshiki Masuyama ... Yasuhiro Oikawa
IEEE Signal Processing Letters | VOL. 26
Yoshiki Masuyama, et. al.Yoshiki Masuyama ... Yasuhiro Oikawa
01 Jan 2019
IEEE Signal Processing Letters | VOL. 26

Solution of Large-scale Structured Optimization Problems with Schur-complement and Augmented Lagrangian Decomposition Methods

-

02 Aug 2019
02 Aug 2019

Consensus ADMM and Proximal ADMM for economic dispatch and AC OPF with SOCP relaxation
Minyue Ma ... Lingling Fan
-
Minyue Ma, et. al.Minyue Ma ... Lingling Fan
01 Sep 2016
01 Sep 2016

Second-Order Multiplier Updates to Accelerate Admm Methods in Optimization Under Uncertainty
Jose S Rodriguez ...
Computer Aided Chemical Engineering | VOL. 47
Jose S Rodriguez, et. al.Jose S Rodriguez ...
01 Jan 2019
Computer Aided Chemical Engineering | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Joint Amplitude and Phase Refinement for Monaural Source Separation

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters