Supervised and unsupervised separation of convolutive speech mixtures using f 0 and formant frequencies

M K Prasanna Kumar,R Kumaraswamy

doi:10.1007/s10772-015-9309-1

Abstract

In this paper we discuss the role of fundamental frequency f0 and formants F1, F2 and F3 of the speech signal in supervised and unsupervised source separation of real recorded convolutive speech mixtures. Initially supervised source separation is discussed where it is assumed that sources are known a priori. The supervised source separation is discussed by considering (1) only fundamental frequency f0, (2) only formants F1, F2 and F3, (3) both f0 and formants F1, F2 and F3. It is observed that last case which involves both f0 and formants gives most accurate separation results and is used as ideal case or reference to compare the separation results obtained for unsupervised source separation. The unsupervised source separation is discussed, where there is no knowledge about the sources a priori. The unsupervised source separation is discussed using (1) cross correlation of formants of different frames along with f0 and (2) standard deviation of magnitude of frequency components in F1, F2 and F3 regions of the spectrogram. It is observed that separation results obtained using both unsupervised methods are very close to the ideal case in supervised source separation. The results show that this method works better than some of the classical blind source separation algorithms like independent component analysis and non negative matrix factorization which works well only for the case of instantaneous mixtures where delay is neglected.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Supervised and unsupervised separation of convolutive speech mixtures using f 0 and formant frequencies

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology

Lead the way for us

Journal: International Journal of Speech Technology	Publication Date: Oct 13, 2015
Citations: 5

Similar Papers

Role of f0 and formant frequencies in unsupervised separation of convolutive speech mixtures
M.K Prasanna Kumar ... R Kumaraswamy
-
M.K Prasanna Kumar, et. al.M.K Prasanna Kumar ... R Kumaraswamy
01 Oct 2015
01 Oct 2015

Acoustic characteristics of the metallic voice quality.
Congeta Bruniere Xavier Fadel ... Rosane Sampaio Santos
CoDAS | VOL. 27
Congeta Bruniere Xavier Fadel, et. al.Congeta Bruniere Xavier Fadel ... Rosane Sampaio Santos
01 Feb 2015
CoDAS | VOL. 27

Frequency-Domain Blind Separation of Convolutive Speech Mixtures with Energy Correlation-Based Permutation Correction
Li-Dan Wang ... Qiu-Hua Lin
-
Li-Dan Wang, et. al.Li-Dan Wang ... Qiu-Hua Lin
01 Jan 2009
01 Jan 2009

A multistage approach to blind separation of convolutive speech mixtures
Tariqullah Jan ... Deliang Wang
Speech Communication | VOL. 53
Tariqullah Jan, et. al.Tariqullah Jan ... Deliang Wang
06 Jan 2011
Speech Communication | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Supervised and unsupervised separation of convolutive speech mixtures using f 0 and formant frequencies

Abstract

Talk to us

Similar Papers

More From: International Journal of Speech Technology