Voice Recognition System Research Articles

Voice recognition systems have gained significant prevalence in our everyday lives, encompassing a wide range of applications, from virtual assistants on smartphones to voice-controlled home automation systems. This research paper presents a comprehensive design and implementation of a voice recognition security system employing artificial neural networks. The system's training involved a dataset consisting of 900 audio samples collected from 10 distinct speakers, enabling the resulting model to accurately classify the speaker of a given audio sample. For the implementation of the voice recognition system, Python serves as the primary programming language. The system leverages the Keras library, which offers a high-level interface for constructing and training neural networks, with efficient computation facilitated by the TensorFlow back-end. Additionally, the Flask framework, a Python-based web framework, was utilized to create a user interface in the form of a web application for the voice recognition system. To effectively train the artificial neural network, the audio data undergoes preprocessing, involving the extraction of relevant features from the audio samples. Subsequently, during the preprocessing phase, the audio data is labelled, and the neural network is trained on this labelled dataset to learn the classification of different speakers. The trained model was rigorously tested on a set of previously unseen audio samples, yielding an impressive classification accuracy exceeding 96%. The finalized model will be integrated into the web application, enabling users to upload audio files and receive accurate predictions regarding the speaker's identity. This paper demonstrates the efficacy of artificial neural networks in the context of voice recognition systems, while also providing a practical framework for constructing such systems using readily available tools and libraries.

This study examines the effectiveness of neural network architectures (multilayer perceptron MLP, convolutional neural network CNN, recurrent neural network RNN) for human voice recognition, with an emphasis on the Kazakh language. Problems related to language, the difference between speakers, and the influence of network architecture on recognition accuracy are considered. The methodology includes extensive training and testing, studying the accuracy of recognition in different languages, and different sets of data on speakers. Using a comparative analysis, this study evaluates the performance of three architectures trained exclusively in the Kazakh language. The testing included statements in Kazakhs and other languages, while the number of speakers varied to assess its impact on recognition accuracy. During the study, the results showed that CNN neural networks are more effective in recognizing human voice than RNN and MLP. Also, it was found that the CNN has a higher accuracy in recognizing the human voice in the Kazakh language, both for a small and for a large number of announcers. For example, for 20 speakers, the recognition error in Russian was 21.86 %, whereas in Kazakhs it was 10.6 %. A similar trend was observed for 80 speakers: 16.2 % Russians and 8.3 % Kazakhs. It can also be argued that learning one language does not guarantee high recognition accuracy in other languages. Therefore, the accuracy of human voice recognition by neural networks depends significantly on the language in which training is conducted. In addition, this study highlights the importance of different sets of speaker data to achieve optimal results. This knowledge is crucial for advancing the development of reliable human voice recognition systems that can accurately identify different human voices in different language contexts

Voice Recognition System Research Articles

Related Topics

Articles published on Voice Recognition System

Biometric voice recognition system in the context of multiple languages: using traditional means of identification of individuals in Nigeria languages and English language

Biometric voice recognition system in the context of multiple languages: using traditional means of identification of individuals in Nigeria languages and English language

Jarvis-Virtual Voice Assistant

Development of an Automatic Baby Cradle System

AROA based Pre-trained Model of Convolutional Neural Network for Voice Pathology Detection and Classification

Low Cost Motorized Patient BED with Voice Control

Design and Development of Smart Blind Stick for Visually Impaired People

Personalized user authentication system using wireless EEG headset and machine learning

Design of a Voice Recognition System Using Artificial Neural Network

The dependence of the effectiveness of neural networks for recognizing human voice on language

Integration of Deep Learning and Collaborative Robot for Assembly Tasks

Perancangan Sistem Deteksi Dan Pengenalan Perintah Suara Menggunakan Modul Esp 32 Dengan Metode Convolutional Neural Network (Cnn)

XGBoost and Convolutional Neural Network Classification Models on Pronunciation of Hijaiyah Letters According to Sanad

A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion

Dome-Shaped Ultrasonic Jammer for speech privacy protection

Speech Emotion Recognition (SER) dengan Metode Bidirectional LSTM

An Overview of Speech-To-Text Conversion

ADVANCING GASIFICATION-COMBINED UP AND DOWN DRAFT GASIFIER-BASED TREATMENT OF TEXTILE WASTE: ASSESSING FEASIBILITY, ENVIRONMENTAL IMPACTS AND ENERGY RECOVERY POTENTIAL

Dynamic visualization BIM and voice recognition system in the optimization application of building construction structure

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Voice Recognition System Research Articles

Related Topics

Articles published on Voice Recognition System

Biometric voice recognition system in the context of multiple languages: using traditional means of identification of individuals in Nigeria languages and English language

Biometric voice recognition system in the context of multiple languages: using traditional means of identification of individuals in Nigeria languages and English language

Jarvis-Virtual Voice Assistant

Development of an Automatic Baby Cradle System

AROA based Pre-trained Model of Convolutional Neural Network for Voice Pathology Detection and Classification

Low Cost Motorized Patient BED with Voice Control

Design and Development of Smart Blind Stick for Visually Impaired People

Personalized user authentication system using wireless EEG headset and machine learning

Design of a Voice Recognition System Using Artificial Neural Network

The dependence of the effectiveness of neural networks for recognizing human voice on language

Integration of Deep Learning and Collaborative Robot for Assembly Tasks

Perancangan Sistem Deteksi Dan Pengenalan Perintah Suara Menggunakan Modul Esp 32 Dengan Metode Convolutional Neural Network (Cnn)

XGBoost and Convolutional Neural Network Classification Models on Pronunciation of Hijaiyah Letters According to Sanad

A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture

Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion

Dome-Shaped Ultrasonic Jammer for speech privacy protection

Speech Emotion Recognition (SER) dengan Metode Bidirectional LSTM

An Overview of Speech-To-Text Conversion

ADVANCING GASIFICATION-COMBINED UP AND DOWN DRAFT GASIFIER-BASED TREATMENT OF TEXTILE WASTE: ASSESSING FEASIBILITY, ENVIRONMENTAL IMPACTS AND ENERGY RECOVERY POTENTIAL

Dynamic visualization BIM and voice recognition system in the optimization application of building construction structure