Redundant Convolutional Network With Attention Mechanism For Monaural Speech Enhancement

Tian Lan,Sen Li,Yilan Lyu,Refuoe Mokhosi,Qiao Liu,Guoqiang Hui

doi:10.1109/icassp40776.2020.9053277

Abstract

The redundant convolutional encoder-decoder network has proven useful in speech enhancement tasks. It can capture localized time-frequency details of speech signals through both the fully convolutional network structure and feature selection capability resulting from the encoder-decoder mechanism. However, it does not explicitly consider the signal filtering mechanism, which we regard as important for speech enhancement models. In this study, we introduce an attention mechanism into the convolutional encoderdecoder model. This mechanism adaptively filters channelwise feature responses by explicitly modeling attentions (on speech versus noise signals) between channels. Experimental results show that the proposed attention model is effective in capturing speech signals from background noise, and performs especially better in unseen noise conditions compared to other state-of-the-art models.

Full Text