Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

M.S Lekshmi,P.S Sathidevi

doi:10.1016/j.procs.2015.02.002

Unsupervised Speech Segregation Using Pitch Information and Time Frequency Masking

M.S Lekshmi, P.S Sathidevi

Open Access

https://doi.org/10.1016/j.procs.2015.02.002

Copy DOI

Journal: Procedia Computer Science	Publication Date: Jan 1, 2015
Citations: 3	License type: cc-by-nc-nd

Affiliation: National Institute of Technology Calicut

#Perceptual Evaluation Of Speech Quality #Time Frequency Mask + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Speech undergoes various acoustic interferences in natural environment, while many of the applications require an effective way to separate the dominant signal from the interference. In this paper, a Short-time Fourier Transform (STFT) based unsupervised method for single channel speech separation is proposed. It uses the pitch information of the dominant and interfering speakers and then generating a time frequency mask based on the pitch frequencies. Through rigorous objective and subjective evaluations, it is shown that the proposed system is capable of providing better Signal to Noise Ratio (SNR) and Perceptual Evaluation of Speech Quality (PESQ) compared to other related methods available in the literature.

Full Text