Speech Recognition System based on Discrete Wave Atoms Transform Partial Noisy Environment

Mohamed Walid,Cherif Adnen,Bousselmi Souha

doi:10.14569/ijacsa.2019.0100560

Abstract

Automatic speech recognition is one of the most active research areas as it offers a dynamic platform for human-machine interaction. The robustness of speech recognition systems is often degraded in real time applications, which are often accompanied by environmental noises. In this work, we have investigated the efficiency of combining wave atoms transform (WAT) with Mel-Frequency Cepstral Coefficients (MFCC) using Support Vector Machine (SVM) as classifier in different noisy conditions. A full experimental evaluation of the proposed model has been conducted using Arabic speech database (ARADIGIT) and corrupted with “NOISEUS database” noises at different levels of SNR ranging from -5 to 15dB. The results of Simulation have indicated that the proposed algorithm has improved the recognition rate (99.9%) at 15 dB of SNR. A comparative study was conducted by applying the proposed WAT-MFCC features to multilayer perceptron (MLP) and hidden Markov model (HMM) in order to prove the efficiency and the robustness of the proposed system.

Highlights

Automatic speech recognition allows the machine to understand and process information provided orally by a human user
A new speech recognition system based on wave atoms transform (WAT)-Mel-Frequency Cepstral Coefficients (MFCC) and Support Vector Machine (SVM) was developed in this paper to improve the accuracy of recognition
Despite worst performances have been obtained using multilayer perceptron (MLP) based MFCC soft with an achieved rate of 84.2%; the use of WAT-MFCC has registered an acceptable accuracy 92.4%

Summary

INTRODUCTION

Automatic speech recognition allows the machine to understand and process information provided orally by a human user. By simplifying the human-machine dialogue protocol, the automatic speech processing aims to gain productivity since it is the machine that adapts to humans to communicate, not the other way around It makes possible the simultaneous use of the eyes or hands to another task. A good speech recognition rates have been mostly reached using small vocabularies This result is considered to be sufficient for the implementation of the most voice control devices. The adopted approach has been tested on Arabic language database in both clean and noisy conditions This manuscript is structured as follows: In Section II, a brief literature review of ASR Systems is presented.

RELATED WORKS

THE PROPOSED SPEECH RECOGNITION SYSTEM

Feature Extraction Stage

Ns ys s S m ym xm

Speech Database

Experimental Results

CONCLUSION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Advanced Computer Science and Applications	Publication Date: Jan 1, 2019
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Speech Recognition System based on Discrete Wave Atoms Transform Partial Noisy Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications

Lead the way for us

Similar Papers

Combined speech enhancement and auditory modelling for robust distributed speech recognition
Edward Jones ... Ronan Flynn
Speech Communication | VOL. 50
Edward Jones, et. al.Edward Jones ... Ronan Flynn
20 May 2008
Speech Communication | VOL. 50

Robust Features for Speech Recognition using Temporal Filtering Technique in the Presence of Impulsive Noise
Hajer Rahali ... Zied Hajaiej
International Journal of Image, Graphics and Signal Processing | VOL. 6
Hajer Rahali, et. al.Hajer Rahali ... Zied Hajaiej
08 Oct 2014
International Journal of Image, Graphics and Signal Processing | VOL. 6

Performance Analysis of various Front-end and Back End Amalgamations for Noise-robust DNN-based ASR
Pawandeep Singh Sethi ... Raghav Chawla
Recent Advances in Computer Science and Communications | VOL. 14
Pawandeep Singh Sethi, et. al.Pawandeep Singh Sethi ... Raghav Chawla
01 Dec 2021
Recent Advances in Computer Science and Communications | VOL. 14

An FPGA-Based Embedded Robust Speech Recognition System Designed by Combining Empirical Mode Decomposition and a Genetic Algorithm
Shing-Tai Pan ... Xu-Yu Li
IEEE Transactions on Instrumentation and Measurement | VOL. 61
Shing-Tai Pan, et. al.Shing-Tai Pan ... Xu-Yu Li
01 Sep 2012
IEEE Transactions on Instrumentation and Measurement | VOL. 61

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Recognition System based on Discrete Wave Atoms Transform Partial Noisy Environment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Advanced Computer Science and Applications