Abstract

This paper proposes an optimum speaker-independent, isolated word Hidden Markov Model (HMM) recognizer for the Hindi language. The recognition system is based on the combination of the vector quantization (VQ) technique at the acoustical level and the Markovian modeling at the recognition level. The recognizer consists of three modules – feature extraction, vector quantizer and HMM training and testing modules. The scheme proposed here firstly computes the acoustic features in terms of the Linear Predictive Cepstral LPC coefficients, Mel-Frequency Cepstral coefficients and delta MFCC along with noise and silence detection. Then, codebooks are created using VQ, and finally in the recognition phase, an optimum set of parameters are derived from different phases for getting the highest recognition score. The training and testing database consists of a set of 35 utterances of nine Indian cities/states and 35 utterances of nine digits spoken in Hindi by male and female speakers. The recognition rate was observed to be 98.61%.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call