Eigennoise Speech Recovery in Adverse Environments with Joint Compensation of Additive and Convolutive Noise

Trung-Nghia Phung,Quang-Vinh Thai,Huy-Khoi Do,Vinh Dinh Nguyen

doi:10.1155/2015/170183

Trung-Nghia Phung, Quang-Vinh Thai + Show 2 more

Open Access

https://doi.org/10.1155/2015/170183

Copy DOI

Journal: Advances in Acoustics and Vibration	Publication Date: Nov 3, 2015
Citations: 11	License type: CC BY 3.0

Affiliation: Thai Nguyen University

Abstract

The learning-based speech recovery approach using statistical spectral conversion has been used for some kind of distorted speech as alaryngeal speech and body-conducted speech (or bone-conducted speech). This approach attempts to recover clean speech (undistorted speech) from noisy speech (distorted speech) by converting the statistical models of noisy speech into that of clean speech without the prior knowledge on characteristics and distributions of noise source. Presently, this approach has still not attracted many researchers to apply in general noisy speech enhancement because of some major problems: those are the difficulties of noise adaptation and the lack of noise robust synthesizable features in different noisy environments. In this paper, we adopted the methods of state-of-the-art voice conversions and speaker adaptation in speech recognition to the proposed speech recovery approach applied in different kinds of noisy environment, especially in adverse environments with joint compensation of additive and convolutive noises. We proposed to use the decorrelated wavelet packet coefficients as a low-dimensional robust synthesizable feature under noisy environments. We also proposed a noise adaptation for speech recovery with the eigennoise similar to the eigenvoice in voice conversion. The experimental results showed that the proposed approach highly outperformed traditional nonlearning-based approaches.

Full Text