Abstract

Speech enhancement has been an intensive research for several decades to enhance the noisy speech that is corrupted by additive noise, multiplicative noise or convolutional noise. Even after decades of research it is still the most challenging problem, because most papers rely on estimating the noise during the nonspeech activity assuming that the background noise is uncorrelated (statistically independent of speech signal), nonstationary and slowly varying, so that the noise characteristics estimated in the absence of speech can be used subsequently in the presence of speech, whereas in a real time environment such assumptions do not hold for all the time. In this paper, we discuss the historical development of approaches that starts from the year 1970 to, the recent, 2013 for enhancing the noisy speech corrupted by additive background noise. Seeing the history, there are algorithms that enhance the noisy speech very well as long as a specific application is concerned such as the In-car noisy environments. It has to be observed that a speech enhancement algorithm performs well with a good estimation of the noise Power Spectral Density (PSD) from the noisy speech. Our idea pops up based on this observation, for online speech enhancement (i.e. in a real time environment) such as mobile phone applications, instead of estimating the noise from the noisy speech alone, the system should be able to monitor an environment continuously and classify it. Based on the current environment of the user, the system should adapt the algorithm (i.e. enhancement or estimation algorithm) for the current environment to enhance the noisy speech.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call