Abstract
Over the last few decades Hidden Markov models (HMM) became core technology in automatic speech recognition (ASR). Contemporary HMM approach is based on usage of Gaussian mixture models (GMM) as acoustic models that are capable of statistical inference of speech variability. Deep neural networks (DNN) applied to ASR as acoustic models outperformed GMM in large vocabulary speech recognition. However, conventional approaches to ASR are very computationally expensive, what makes it impossible to apply them in voice control systems on low power devices. This paper focuses on the approach to isolated words recognition with reduced computational costs, what makes it feasible for in-place recognition on low computational resources devices. All components of the isolated words recognizer are described. Quantized Mel-frequency cepstral coefficients are used as speech features. The fast algorithm of isolated words recognition is described. It is based on a stationary distribution of Hidden Markov model and has linear computational complexity. Another important feature of the proposed approach is that it requires significantly less memory to store model parameters comparing to HMM-GMM and DNN models. Algorithm performance is evaluated on TIMIT isolated words dataset. The proposed method performance is compared with the results, that showed conventional forward algorithm, HMM-GMM approach and Self-Adjustable Neural Network. Only HMM-GMM outperformed proposed stationary distribution approach.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.