Digital Speech Processing Research Articles

Wireless communication has become increasingly popular in recent years due to its mobility, portability and efficient service. Quality Voice Communication is the basic need of all such devices. Due to the significant spectrum demand, the bandwidth allocated to Professional Radio devices is decreasing. This results in lowering voice/ speech quality. Due to limited bandwidth, the design of Radio is changing. The main change is shifting from Analogue signal processing to Digital signal processing of speech and radio signals. The conversion of analogue-natured speech to digital form itself creates loss and faithfulness. Such Radio includes a voice coder for analogue to digital conversion of speech. Human spoken language, speaker (male or female), and pronunciation play an essential role in direct voice and Radio communication. It is the primary object which decides the design of low bandwidth coder. This paper deals with Marathi language characteristics and the interdependence of language constructs that affects voice/speech quality. In this paper, we extracted features of the Marathi language using PRAAT software and analyzed its constructs using statistical tools. The actual testing on Radio is carried out. The speech quality observed during such testing is correlated with statistical results. It is shown that the standard deviation in pitch and formants in male speech is less (average 35%) compared to the standard deviation in pitch and formants in female speech. This results in lowering the quality of male speech compared to female speech. It is also shown that some vowels and combinations of certain vowels and consonants produce poor speech quality compared to others.

Feature extraction is a critical stage of digital speech processing systems. Quality of features is of great importance to provide a solid foundation upon which the subsequent stages stand. Distinctive phonetic features (DPFs) are one of the most representative features of the speech signals. The significance of DPFs is in their ability to provide abstract description of the places and manners of articulation of the language phonemes. A phoneme's DPF element reflects unique articulatory information about that phoneme. Therefore, there is a need to discover and investigate each DPF element individually in order to achieve a deeper understanding and to come up with a descriptive model for each one. Such fine-grained modeling will satisfy the uniqueness of each DPF element. In this paper, the problem of DPF modeling and extraction of modern standard Arabic is tackled. Due to the remarkable success of deep neural networks (DNNs) that are initialized using deep belief networks (DBNs) in serving DSP applications and its capability of extracting highly representative features from the raw data, we exploit its modeling power to investigate and model the DPF elements. DNN models are compared with the classical multilayer perceptron (MLP) models. The representativeness of several acoustic cues for different DPF elements was also measured. This paper is based on formalizing DPF modeling problem as a binary classification problem. Because the DPF elements are highly imbalanced data, evaluating the quality of models is a very tricky process. This paper addresses the proper evaluation measures satisfying the imbalanced nature of the DPF elements. After modeling each element individually, the two top-level DPF extractors are designed: MLP- and DNN-based extractors. The results show the quality of DNN models and their superiority over MLPs with accuracies of 89.0% and 86.7%, respectively.

Digital Speech Processing Research Articles

Related Topics

Articles published on Digital Speech Processing

Modern Standard Arabic speech disorders corpus for digital speech processing applications

A measure of differences in speech signals by the voice timbre

English

Speech Coding Using Discrete Cosine Transform and Chaotic Map

MilVAD: A bag-level MNIST modelling of voice activity detection using deep multiple instance learning

A NOVEL SPEECH RECOGNITION SYSTEM USING FUZZY NEURAL NETWORK

Speaker Identification in Different Emotional States in Arabic and English

Fuzzy Logic Based Voice Recognition As Per Their Gender And Age Group

Fuzzy Logic Based Voice Recognition as per Their Gender and Age Group

Turkish vowel classification based on acoustical and decompositional features optimized by Genetic Algorithm

Distinctive Phonetic Features Modeling and Extraction Using Deep Neural Networks

Nova metoda za prepoznavanje aktivnosti ljudi zasnovana na IMU senzorima i na teoriji digitalne obrade govora

Perceptual Significance of Cepstral Distortion Measures in Digital Speech Processing

A Review on Feature Extraction Techniques for Speech Processing

Intelligent Integrated Home Security System Using Raspberry Pi

Speech Coding Techniques

USING DIGITAL PROCESSING OF SPEECH AND VIDEO TO SUPPORT INTERACTION BETWEEN THE DEAF COMMUNITY AND NORMAL PEOPLE

HIGH SPEED CARRY SAVE MULTIPLIER BASED LINEAR CONVOLUTION USING VEDIC MATHAMATICS

Wavelet adaptation for automatic voice disorders sorting

Review of distinctive phonetic features and the Arabic share in related modern research

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Digital Speech Processing Research Articles

Related Topics

Articles published on Digital Speech Processing

Modern Standard Arabic speech disorders corpus for digital speech processing applications

A measure of differences in speech signals by the voice timbre

English

Speech Coding Using Discrete Cosine Transform and Chaotic Map

MilVAD: A bag-level MNIST modelling of voice activity detection using deep multiple instance learning

A NOVEL SPEECH RECOGNITION SYSTEM USING FUZZY NEURAL NETWORK

Speaker Identification in Different Emotional States in Arabic and English

Fuzzy Logic Based Voice Recognition As Per Their Gender And Age Group

Fuzzy Logic Based Voice Recognition as per Their Gender and Age Group

Turkish vowel classification based on acoustical and decompositional features optimized by Genetic Algorithm

Distinctive Phonetic Features Modeling and Extraction Using Deep Neural Networks

Nova metoda za prepoznavanje aktivnosti ljudi zasnovana na IMU senzorima i na teoriji digitalne obrade govora

Perceptual Significance of Cepstral Distortion Measures in Digital Speech Processing

A Review on Feature Extraction Techniques for Speech Processing

Intelligent Integrated Home Security System Using Raspberry Pi

Speech Coding Techniques

USING DIGITAL PROCESSING OF SPEECH AND VIDEO TO SUPPORT INTERACTION BETWEEN THE DEAF COMMUNITY AND NORMAL PEOPLE

HIGH SPEED CARRY SAVE MULTIPLIER BASED LINEAR CONVOLUTION USING VEDIC MATHAMATICS

Wavelet adaptation for automatic voice disorders sorting

Review of distinctive phonetic features and the Arabic share in related modern research