Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection

Feng Li,Yujun Hu,Lingling Wang

doi:10.3390/s23063015

Feng Li, Yujun Hu + Show 1 more

Open Access

https://doi.org/10.3390/s23063015

Copy DOI

Journal: Sensors	Publication Date: Mar 10, 2023
Citations: 1	License type: CC BY 4.0

Affiliation: Anhui University of Finance and Economics

Abstract

Singing-voice separation is a separation task that involves a singing voice and musical accompaniment. In this paper, we propose a novel, unsupervised methodology for extracting a singing voice from the background in a musical mixture. This method is a modification of robust principal component analysis (RPCA) that separates a singing voice by using weighting based on gammatone filterbank and vocal activity detection. Although RPCA is a helpful method for separating voices from the music mixture, it fails when one single value, such as drums, is much larger than others (e.g., the accompanying instruments). As a result, the proposed approach takes advantage of varying values between low-rank (background) and sparse matrices (singing voice). Additionally, we propose an expanded RPCA on the cochleagram by utilizing coalescent masking on the gammatone. Finally, we utilize vocal activity detection to enhance the separation outcomes by eliminating the lingering music signal. Evaluation results reveal that the proposed approach provides superior separation outcomes than RPCA on ccMixter and DSD100 datasets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Similar Papers

Weighted Robust Principal Component Analysis with Gammatone Auditory Filterbank for Singing Voice Separation
Feng Li ... Masato Akagi
-
Feng Li, et. al.Feng Li ... Masato Akagi
01 Jan 2017
01 Jan 2017

Speech Denoising via Low-Rank and Sparse Matrix Decomposition
Jianjun Huang
ETRI Journal | VOL. 36
Jianjun HuangJianjun Huang
01 Feb 2014
ETRI Journal | VOL. 36

Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis
Feng Li ... Masato Akagi
-
Feng Li, et. al.Feng Li ... Masato Akagi
01 Nov 2018
01 Nov 2018

Blind monaural singing voice separation using rank-1 constraint robust principal component analysis and vocal activity detection
Feng Li ... Masato Akagi
Neurocomputing | VOL. 350
Feng Li, et. al.Feng Li ... Masato Akagi
17 Apr 2019
Neurocomputing | VOL. 350

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection

Abstract

Talk to us

Similar Papers

More From: Sensors