Digital Processing Of Speech Signals Research Articles

Speech recognition refers to the capability of software or hardware to receive a speech signal, identify the speaker's features in the speech signal, and recognize the speaker thereafter. In general, the speech recognition process involves three main steps: acoustic processing, feature extraction, and classification/recognition. The purpose of feature extraction is to illustrate a speech signal using a predetermined number of signal components. This is because all information in the acoustic signal is excessively cumbersome to handle, and some information is irrelevant in the identification task. This study proposes a machine learning-based approach that performs feature parameter extraction from speech signals to improve the performance of speech recognition applications in real-time smart city environments. Moreover, the principle of mapping a block of main memory to the cache is used efficiently to reduce computing time. The block size of cache memory is a parameter that strongly affects the cache performance. In particular, the implementation of such processes in real-time systems requires a high computation speed. Processing speed plays an important role in speech recognition in real-time systems. It requires the use of modern technologies and fast algorithms that increase the acceleration in extracting the feature parameters from speech signals. Problems with overclocking during the digital processing of speech signals have yet to be completely resolved. The experimental results demonstrate that the proposed method successfully extracts the signal features and achieves seamless classification performance compared to other conventional speech recognition algorithms.

Read full abstract

In recent years techniques of digital processing of speech signals have been used as an auxiliary tool in the evaluation of vocal deviations, providing the patient with greater comfort low cost and objectivity when compared to the techniques traditionally employed, such as perceptual-auditory analysis. The evaluation of vocal quality, through acoustic analysis of voice signals, is becoming a very popular clinical practice for the detection of vocal disorders that in some cases can be caused by laryngeal lesions or vocal abuse. In this research, we used some traditional non-linear measures combined with measures of recurrence quantification for the discriminative analysis of vocal deviations, breathiness, roughness and strain. The characteristics of the non-linear dynamic analysis,used in the classification process, were the Reconstruction Step (τ), the First Minimum of the Mutual Information Function (PM) and the Correlation Dimension (D2). The quantification measures employed were: Determinism (Det), Shannon entropy (Entr), Mean length of diagonal lines (Lmed), Maximum length of vertical lines (Vmax) and Transitivity (Trans). Through these statistical tests, the potential of each characteristic to discriminate the types of voice signals was evaluated. In the classification process, the neural network MLP (Multilayer Perceptron) was used, with supervised learning algorithm Graded Conjugate Gradient (SCG). There was an average accuracy of 90% in the discrimination between healthy and deviant voices. In the classification between healthy and strained voices, an average accuracy of 76% was obtained with the combined measures Trans, τ , Vmax, Lmed, Det and D2. In the detection of the roughness deviation, an average accuracy of 89% was obtained with the Lmed, Entr, Trans and D2 measures and in the distinction between healthy and breathy voices, 91.17% of accuracy was obtained with only two combined measures, Trans and τ , showing the promising character of the used technique.

Read full abstract

Digital Processing Of Speech Signals Research Articles

Related Topics

Articles published on Digital Processing Of Speech Signals

Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm.

Speech Enhancement Using Deep Learning Methods: A Review

Method for Measuring the Indicator of Acoustic Quality of Audio Recordings Prepared for Registration and Processing in the Unified Biometric System

Linear and nonlinear versions of Phase Retrieval

Measurements method of the audio recordings acoustic quality indicator prepared for registration and processing in the Unified Biometric System

Exploiting nonlinearity of the speech production system for voice disorder assessment by recurrence quantification analysis.

A speech communication educational platform with high-capacity information hiding scheme

Classificação de disfonias por meio da análise de medidas não lineares e de quantificação de recorrência

Forensic Automatic Speaker Recognition [Exploratory DSP

Template-based and HMM-based Approaches for Isolated Spanish Digit Recognition

SpeechLab: PC software for digital speech signal processing

VLSI architecture for digital processing of speech signals

Digital speech signal processing for pitch change with jump control in accordance with pitch period

Digital processing of speech signals

Digital Processing of Speech Signals, by L. R. Rabiner and R. W. Schafer

Digital processing of speech signals

Book Review: Digital Processing of Speech Signals

Book reviews - Digital processing of speech signals

Digital Processing of Speech Signals

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Digital Processing Of Speech Signals Research Articles

Related Topics

Articles published on Digital Processing Of Speech Signals

Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm.

Speech Enhancement Using Deep Learning Methods: A Review

Method for Measuring the Indicator of Acoustic Quality of Audio Recordings Prepared for Registration and Processing in the Unified Biometric System

Linear and nonlinear versions of Phase Retrieval

Measurements method of the audio recordings acoustic quality indicator prepared for registration and processing in the Unified Biometric System

Exploiting nonlinearity of the speech production system for voice disorder assessment by recurrence quantification analysis.

A speech communication educational platform with high-capacity information hiding scheme

Classificação de disfonias por meio da análise de medidas não lineares e de quantificação de recorrência

Forensic Automatic Speaker Recognition [Exploratory DSP

Template-based and HMM-based Approaches for Isolated Spanish Digit Recognition

SpeechLab: PC software for digital speech signal processing

VLSI architecture for digital processing of speech signals

Digital speech signal processing for pitch change with jump control in accordance with pitch period

Digital processing of speech signals

Digital Processing of Speech Signals, by L. R. Rabiner and R. W. Schafer

Digital processing of speech signals

Book Review: Digital Processing of Speech Signals

Book reviews - Digital processing of speech signals

Digital Processing of Speech Signals