DEEP REINFORCEMENT LEARNING WITH HIDDEN MARKOV MODEL FOR SPEECH RECOGNITION

Samson Isaac,Rabi Mustapha,Muhammad Aminu Ahmad,Khalid Haruna

doi:10.26480/jtin.01.2023.01.05

Abstract

Nowadays, many applications uses speech recognition especially the field of computer science and electronics, Speech Recognition (SR) is the interpretation of words spoken into a text. It is also known as Speech-To-Text (STT) or Automatic-Speech-Recognition(ASR), or just Word-Recognition(WR). The Hidden-Markov-Model (HMM) is a type of Markov model, which means that the future state of the model depends on the current state, not on the entire history of the system and the goal of HMM is to learn a sequence of hidden states from a set of known states. The Long-Short-Time-Memory (LSTM) network is a type of Recurrent Neural Network (RNN) that can learn long-term dependencies between time steps of sequence data. The LSTM network is trained by the network in order to predict the values of subsequent time steps in a series-to-series regression. Deep Neural Network (DNN) models are better classifiers than Gaussian Mixture Models (GMMs), they can generalize much better with a smaller number of parameters over complex distributions. They model distributions of different classes jointly, called “distributed” learning, or, more properly “tied” learning. This work is aimed at developing a speech recognition model that will predict isolated speech of some selected fruits in Hausa, Igbo and Yoruba language by using the predicting power of Mel-Frequency-Cepstral-Coefficient (MFCC), LSTM and HMM algorithms. The findings of the study would improve the development of better automatic speech applications systems and would benefit the academic and research community in the field of Natural Language Processing.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

DEEP REINFORCEMENT LEARNING WITH HIDDEN MARKOV MODEL FOR SPEECH RECOGNITION

Abstract

Talk to us

Similar Papers

More From: JOURNAL OF TECHNOLOGY & INNOVATION

Lead the way for us

Similar Papers

Быстрый алгоритм распознавания голосовых команд на основе стационарного распределения скрытой марковской модели
Pavel A Paramonov ... Ivan V Ognev
Vestnik MEI | VOL. 5
Pavel A Paramonov, et. al.Pavel A Paramonov ... Ivan V Ognev
01 Jan 2018
Vestnik MEI | VOL. 5

On quantifying the quality of acoustic models in hybrid DNN-HMM ASR
Pranay Dighe ... Hervé Bourlard
Speech Communication | VOL. 119
Pranay Dighe, et. al.Pranay Dighe ... Hervé Bourlard
10 Mar 2020
Speech Communication | VOL. 119

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Mohit Dua ... Vinam Agrawal
Recent Advances in Computer Science and Communications | VOL. 14
Mohit Dua, et. al.Mohit Dua ... Vinam Agrawal
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

Dialect Identification in Telugu Language Speech Utterance Using Modified Features with Deep Neural Network
Shivaprasad Satla ... Sadanandam Manchala
Traitement du Signal | VOL. 38
Shivaprasad Satla, et. al.Shivaprasad Satla ... Sadanandam Manchala
31 Dec 2021
Traitement du Signal | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

DEEP REINFORCEMENT LEARNING WITH HIDDEN MARKOV MODEL FOR SPEECH RECOGNITION

Abstract

Talk to us

Similar Papers

More From: JOURNAL OF TECHNOLOGY &amp; INNOVATION

More From: JOURNAL OF TECHNOLOGY & INNOVATION