Online Convolutive Non-Negative Bases Learning for Speech Enhancement

Yinan Li,Xiongwei Zhang,Li Li,Yonggang Hu,Meng Sun

doi:10.1587/transfun.e99.a.1609

Abstract

An online version of convolutive non-negative sparse coding (CNSC) with the generalized Kullback-Leibler (K-L) divergence is proposed to adaptively learn spectral-temporal bases from speech streams. The proposed scheme processes training data piece-by-piece and incrementally updates learned bases with accumulated statistics to overcome the inefficiency of its offline counterpart in processing large scale or streaming data. Compared to conventional non-negative sparse coding, we utilize the convolutive model within bases, so that each basis is capable of describing a relatively long temporal span of signals, which helps to improve the representation power of the model. Moreover, by incorporating a voice activity detector (VAD), we propose an unsupervised enhancement algorithm that updates the noise dictionary adaptively from non-speech intervals. Meanwhile, for the speech intervals, one can adaptively learn the speech bases by keeping the noise ones fixed. Experimental results show that the proposed algorithm outperforms the competing algorithms substantially, especially when the background noise is highly non-stationary.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences	Publication Date: Jan 1, 2016
Citations: 1	License type: free

R Discovery Prime

R Discovery Prime

Online Convolutive Non-Negative Bases Learning for Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences

Lead the way for us

Similar Papers

Proceedings of the 2016 on SIGMOD'16 PhD Symposium
...
-
, et. al. ...
14 Jun 2016
Proceedings of the 2016 on SIGMOD'16 PhD Symposium
...

Geospatial Data Streams
Zdravko Galić
-
Zdravko GalićZdravko Galić
24 Oct 2017
24 Oct 2017

Review of classical dimensionality reduction and sample selection methods for large-scale data processing
Xinzheng Xu ... Jiong Zhu
Neurocomputing | VOL. 328
Xinzheng Xu, et. al.Xinzheng Xu ... Jiong Zhu
17 Aug 2018
Neurocomputing | VOL. 328

The Key Technologies of Real-Time Processing Large Scale Microblog Data Stream
Yunpeng Cao ... Haifeng Wang
-
Yunpeng Cao, et. al.Yunpeng Cao ... Haifeng Wang
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Online Convolutive Non-Negative Bases Learning for Speech Enhancement

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences