Abstract

We propose a new feature extraction algorithm that is robust against noise. Nonlinear ltering and temporal masking are used for the proposed algorithm. Since the current automatic speech recognition systems use invariant- integration and delta-delta techniques for speech feature extraction, the proposed algorithm improves speech recognition accuracy appropriately using a delta-spectral feature instead of invariant integration. One of the nonenvironmental factors that reduce recognition accuracy is the vocal tract length (VTL), leading to a mismatch between the training and testing data. We can use the invariant-integration feature idea for decreasing the VTL eects. The aim of this paper is to provide robust features that provide improvements in dierent noise conditions as well as being robust against VTL eect changes. This results in more improvement of the recognition accuracy in comparison with mel-frequency cepstral coecients and perceptual linear prediction in the presence of dierent types of noises and scenarios.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call