Abstract

Abstract This paper constructs a Chinese speech evaluation system based on the characteristics of the vocal mechanism of Chinese speech and uses the detection framework of statistical speech recognition to realize the automatic detection function of pronunciation bias. Assuming that the amplitude spectra of speech and noise obey the generalized Gamma distribution and Gaussian distribution, respectively, the MMSE estimator of the logarithmic spectrum of the speech signal is derived. The speech presence probability under the generalized Gamma speech model is derived as a correction to the MMSE estimation based on language evolution and diversity features. The correct recognition rate of the proposed algorithm reaches 64%, which indicates that the proposed algorithm can effectively suppress the interference of convolutional noise on the speech signal features and improve the subjective perceptual quality of diverse languages.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call