Real-time Speech Recognition System Research Articles

This paper investigates the development of a real-time automatic speech recognition system dedicated to the Azerbaijani language, focusing on addressing the prevalent gap in speech recognition system for underrepresented languages. Our research integrates a hybrid acoustic modeling approach that combines Hidden Markov Model and Deep Neural Network to interpret the complexities of Azerbaijani acoustic patterns effectively. Recognizing the agglutinative nature of Azerbaijani, the ASR system employs a syllable-based n-gram model for language modeling, ensuring the system accurately captures the syntax and semantics of Azerbaijani speech. To enable real-time capabilities, we incorporate WebSocket technology, which facilitates efficient bidirectional communication between the client and server, necessary for processing streaming speech data instantly. The Kaldi and SRILM toolkits are used for the training of acoustic and language models, respectively, contributing to the system's robust performance and adaptability. We have conducted comprehensive experiments to test the effectiveness of our system, the results of which strongly corroborate the utility of the syllable-based subword modeling approach for Azerbaijani language recognition. Our proposed ASR system shows superior performance in terms of recognition accuracy and rapid response times, outperforming other systems tested on the same language data. The system's success not only proves beneficial for Azerbaijani language recognition but also provides a valuable framework for potential future applications in other agglutinative languages, thereby contributing to the promotion of linguistic diversity in automatic speech recognition technology.

This article uses Field Programmable Gate Array (FPGA) as a carrier and uses IP core to form a System on Programmable Chip (SOPC) English speech recognition system. The SOPC system uses a modular hardware system design method. Except for the independent development of the hardware acceleration module and its control module, the other modules are implemented by software or IP provided by Xilinx development tools. Hardware acceleration IP adopts a top-down design method, provides parallel operation of multiple operation components, and uses pipeline technology, which speeds up data operation, so that only one operation cycle is required to obtain an operation result. In terms of recognition algorithm, a more effective training algorithm is proposed, Genetic Continuous Hidden Markov Model (GA_CHMM), which uses genetic algorithm to directly train CHMM model. It is to find the optimal model by encoding the parameter values of the CHMM and performing operations such as selection, crossover, and mutation according to the fitness function. The optimal parameter value after decoding corresponds to the CHMM model, and then the English speech recognition is performed through the CHMM algorithm. This algorithm can save a lot of training time, thereby improving the recognition rate and speed. This paper studies the optimization of embedded system software. By studying the fixed-point software algorithm and the optimization of system storage space, the real-time response speed of the system has been reduced from about 10 seconds to an average of 220 milliseconds. Through the optimization of the CHMM algorithm, the real-time performance of the system is improved again, and the average time to complete the recognition is significantly shortened. At the same time, the system can achieve a recognition rate of over 90% when the English speech vocabulary is less than 200.

Real-time Speech Recognition System Research Articles

Articles published on Real-time Speech Recognition System

Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation.

Real-Time Automatic Continuous Speech Recognition System for Kannada Language/Dialects

DEVELOPMENT OF A REAL-TIME SPEECH RECOGNITION SYSTEM FOR THE AZERBAIJANI LANGUAGE

Voice Assistant Notepad

Design and Implementation of Energy-Efficient Floating Point MFCC Extraction Architecture for Speech Recognition Systems

Design and Implementation of Embedded Real-Time English Speech Recognition System Based on Big Data Analysis

Practical Study and Implementation of an Isolated Word Speech Recognition System.(Dept.E)

RETRACTED: Real time speech recognition algorithm on embedded system based on continuous Markov model

Real-time transcription, keyword spotting, archival and retrieval for telugu TV news using ASR

A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board

, ,

Question answering in conversations: Query refinement using contextual and semantic information

Energy-Efficient Floating-Point MFCC Extraction Architecture for Speech Recognition Systems

FPGA-Based Hardware Accelerator for Feature Extraction in Automatic Speech Recognition

Real-Time Arabic Speech Recognition

Robust Automatic Speech recognition System Implemented in a Hybrid Design DSP-FPGA

Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors

Design of a real time automatic speech recognition system using Modified One Against All SVM classifier

Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy

Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Real-time Speech Recognition System Research Articles

Articles published on Real-time Speech Recognition System

Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation.

Real-Time Automatic Continuous Speech Recognition System for Kannada Language/Dialects

DEVELOPMENT OF A REAL-TIME SPEECH RECOGNITION SYSTEM FOR THE AZERBAIJANI LANGUAGE

Voice Assistant Notepad

Design and Implementation of Energy-Efficient Floating Point MFCC Extraction Architecture for Speech Recognition Systems

Design and Implementation of Embedded Real-Time English Speech Recognition System Based on Big Data Analysis

Practical Study and Implementation of an Isolated Word Speech Recognition System.(Dept.E)

RETRACTED: Real time speech recognition algorithm on embedded system based on continuous Markov model

Real-time transcription, keyword spotting, archival and retrieval for telugu TV news using ASR

A Robust Feature Extraction Method for Real-Time Speech Recognition System on a Raspberry Pi 3 Board

, ,

Question answering in conversations: Query refinement using contextual and semantic information

Energy-Efficient Floating-Point MFCC Extraction Architecture for Speech Recognition Systems

FPGA-Based Hardware Accelerator for Feature Extraction in Automatic Speech Recognition

Real-Time Arabic Speech Recognition

Robust Automatic Speech recognition System Implemented in a Hybrid Design DSP-FPGA

Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors

Design of a real time automatic speech recognition system using Modified One Against All SVM classifier

Online Speech/Music Segmentation Based on the Variance Mean of Filter Bank Energy

Mutual Information Based Dynamic Integration of Multiple Feature Streams for Robust Real-Time LVCSR