Markov random field in speech enhancement: Application for tonal languages

Tanawan Saimai,Chai Wutiwiwatchai,Chutamanee Onsuwan,Charturong Tantibundhit

doi:10.1121/1.4805893

Abstract

This paper proposed speech enhancement algorithm based on Markov random field (MRF) model for Thai, a tonal language. Firstly, a noisy speech signal is transformed using the short time Fourier transform (STFT). In so doing, noise is removed and speech is preserved, especially harmonics information as f0 patterns are relevant perceptual cues for lexical tones. The voice activity detector is used to classify each STFT time frame into voiced and unvoiced. Harmonics information is retrieved from each voiced time frame, where four neighborhoods of the analyzed STFT coefficients include its adjacent time frames (left, right) and nearest harmonics (top, bottom). For the unvoiced, four adjacent coefficients (left, right, top, and bottom) are used. A two-state MRF model is used to classify STFT coefficients into speech and noise. Those with speech state are retained, while the rest is set to zero. The enhanced speech is estimated by the inverse STFT. Results from quality evaluation test on four sets of Thai rhyming words corrupted by white noise at SNR levels of 0, 5, and 10 dB showed that the proposed algorithm significantly improved SNR of noisy speeches compared with spectral subtraction (1.3 dB on average) and Wiener filtering (1.9 dB on average).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Markov random field in speech enhancement: Application for tonal languages

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Similar Papers

Thai speech enhancement using Markov random field
T Saimai ... C Tantibundhit
-
T Saimai, et. al.T Saimai ... C Tantibundhit
01 May 2012
01 May 2012

Supervised single channel dual domains speech enhancement using sparse non-negative matrix factorization
Md Shohidul Islam ... Zhongfu Ye
Digital Signal Processing | VOL. 100
Md Shohidul Islam, et. al.Md Shohidul Islam ... Zhongfu Ye
19 Feb 2020
Digital Signal Processing | VOL. 100

Speech Denoising via Low-Rank and Sparse Matrix Decomposition
Jianjun Huang
ETRI Journal | VOL. 36
Jianjun HuangJianjun Huang
01 Feb 2014
ETRI Journal | VOL. 36

STFT-Domain Neural Speech Enhancement With Very Low Algorithmic Latency
Zhong-Qiu Wang ... Gordon Wichern
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31
Zhong-Qiu Wang, et. al.Zhong-Qiu Wang ... Gordon Wichern
01 Jan 2023
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 31

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Markov random field in speech enhancement: Application for tonal languages

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America