Using an Adjustment Training and a Smoothing Mask for Speech Segregation

Yi Jiang,Run-Sheng Liu,Yuan-Yuan Zu

doi:10.12783/dtetr/icmm2017/20356

Using an Adjustment Training and a Smoothing Mask for Speech Segregation

Yi Jiang, Run-Sheng Liu + Show 1 more

Open Access

https://doi.org/10.12783/dtetr/icmm2017/20356

Copy DOI

Journal: DEStech Transactions on Engineering and Technology Research

Publication Date: Apr 3, 2018

#Computational Auditory Scene Analysis #Improvement Of Speech Intelligibility + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper focuses on the improvement of speech intelligibility and nature auditory perception. A dual microphone computational auditory scene analysis (CASA) based speech segregation system is proposed. A deep neural network (DNN) is equipped to estimate the parameter mask, which is used to train a smoothing mask to segregate the target speech from the mixture. A mask smoothing method is proposed to reduce the musical noise, which is caused by estimation errors. The performance of the proposed method is systematic evaluated with the simulated and recording data. The tests show that the proposed method improves the signal to noise ratio (SNR), suppress the musical noise, and has good performance on untrained locations and reverberant test conditions too.

Full Text