Acoustic Feature Vectors Research Articles

Automatic detection of vowels plays a significant role in the analysis and synthesis of speech signal. Detecting vowels within a speech utterance in noisy environment and varied contexts is a very challenging task. In this work, a robust technique based on non-local means (NLM) estimation is proposed for the detection of vowels in noisy speech signals. In the NLM algorithm, the signal value at each sample point is estimated as the weighted sum of signal values at other sample points within a search neighborhood. The weight value is computed by finding square of the difference between the signal values belonging to two different segments. During the estimation, one segment is kept as fixed, while other segment is slid over the search neighborhood. For any particular sample point, the sum of those weight values is significantly less when the segments under consideration are higher in magnitude. In a given speech signal, the vowels are regions of high energy. This will be true even under noisy conditions. In this work, the sum of weight values (SWV), computed at each time instant is used as a discriminating feature for detecting the vowels in a given speech signal. In the proposed approach, the regions where the SWV exhibits significant transitions and attain lower values for a considerable duration of time compared to the preceding and succeeding regions are hypothesized as the vowels. This hypothesis is statistically validated for detecting vowels under clean as well as noisy test conditions. For proper comparison, a three-class statistical classifier (vowel, non-vowel and silence) is developed for detecting the vowels in a given speech signal. For developing the said classifier, the mel-frequency cepstral coefficients are used as the acoustic feature vectors, while deep neural network (DNN)-hidden Markov model (HMM) is employed for acoustic modeling. The proposed vowel detection method is observed to outperform the DNN-HMM-based statistical classifiers as well as existing signal processing approaches under both clean and noisy test conditions.

The flow of particulate solid materials in a gas flowline can significantly erode mechanical equipment. Hence, real-time quantitative monitoring is a timely need for the oil and gas industry to achieve real-time control and production optimisation. Although a considerable amount of research has been conducted employing acoustic signals for qualitative monitoring, there is still an unmet demand for a simple and robust real-time quantitative monitoring system. Acoustic signal processing with machine learning is a simple and robust method that has the potential to meet this demand but has not been previously exploited for real-time quantitative monitoring of particulate solid materials in a gas flowline. This paper proposes a novel instrumentation system for on-line measurement of solid flow rate, solid concentration, line pressure drop and gas velocity in a gas-solid multiphase flow using acoustic sensing technology coupled with signal processing techniques and machine learning algorithm. The acoustic sensor is used to capture the acoustic wave emitted from the impingements of the solid particles on the bend component of the flowline. Signal processing techniques are used to extract relevant features about the impingements. An integrated, conventional Artificial Neural Network (ANN) is used to capture the distribution of the acoustic feature vectors in order to establish the relationship between the measurands and the acoustic signal. However, conventional ANNs are mainly concerned with capturing systematic patterns in a distribution of measurements fixed in time and in this case the dynamics of the generated acoustic signal varies with time. A modification, called Time-Delay Neural Network (TDNN) is used to capture such dynamics. The proposed system compares the performance of the classical ANN and the TDNN models. Results obtained demonstrate that with the classical ANN, the normalised root mean square error (NRMSE) is 0.66, 0.29, 0.26 and 0.46 for the solid flow rate, solid concentration, line pressure drop and gas velocity respectively. With the TDNN model, the NRMSE is 0.18, 0.17, 0.20 and 0.16 for the solid flow rate, solid concentration, line pressure drop and gas velocity respectively. In comparison with the ANN model, the TDNN model has better performance as the NRMSE values are lower for all the models for the measurands. Overall, this study lays the basis for employing signal processing techniques and machine learning algorithm in the development of a simple, reliable and low cost real-time quantitative particulate solid flow monitoring system.

Acoustic Feature Vectors Research Articles

Related Topics

Articles published on Acoustic Feature Vectors

Ensemble Feature Selection for Age Estimation from Speech

Confusion and Countermeasures of College Students’ Career Guidance Work Based on Deep Learning Models

A short utterance speaker recognition method with improved cepstrum–CNN

Improving Depression Prediction Accuracy Using Fisher Score-Based Feature Selection and Dynamic Ensemble Selection Approach Based on Acoustic Features of Speech

A study of using cough sounds and deep neural networks for the early detection of Covid-19

A Novel Human-Vehicle Interaction Assistive Device for Arab Drivers Using Speech Recognition

Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding

Mobile microphone robust acoustic feature identification using coefficient of variance

L2 Mispronunciation Verification Based on Acoustic Phone Embedding and Siamese Networks

Hilbert–Huang–Hurst‐based non‐linear acoustic feature vector for emotion classification with stochastic models and learning systems

Speeding up training of automated bird recognizers by data reduction of audio features.

Standard Yorùbá context dependent tone identification using Multi-Class Support Vector Machine (MSVM)

An Adaptive Method for Robust Detection of Vowels in Noisy Environment

Speech spoofing countermeasures based on source voice analysis and machine learning techniques

A comparative study of deep neural network based Punjabi-ASR system

Continuous Tamil Speech Recognition technique under non stationary noisy environments

Acoustic signal processing with robust machine learning algorithm for improved monitoring of particulate solid materials in a gas flowline

Recent progress in deep end-to-end models for spoken language processing

A flexible discriminative approach to automatic phone and broad phonetic group classification

Bayesian hindcast of acoustic transmission loss in the western Pacific Ocean

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Acoustic Feature Vectors Research Articles

Related Topics

Articles published on Acoustic Feature Vectors

Ensemble Feature Selection for Age Estimation from Speech

Confusion and Countermeasures of College Students’ Career Guidance Work Based on Deep Learning Models

A short utterance speaker recognition method with improved cepstrum–CNN

Improving Depression Prediction Accuracy Using Fisher Score-Based Feature Selection and Dynamic Ensemble Selection Approach Based on Acoustic Features of Speech

A study of using cough sounds and deep neural networks for the early detection of Covid-19

A Novel Human-Vehicle Interaction Assistive Device for Arab Drivers Using Speech Recognition

Multi-Modal Emotion Recognition Using Speech Features and Text-Embedding

Mobile microphone robust acoustic feature identification using coefficient of variance

L2 Mispronunciation Verification Based on Acoustic Phone Embedding and Siamese Networks

Hilbert–Huang–Hurst‐based non‐linear acoustic feature vector for emotion classification with stochastic models and learning systems

Speeding up training of automated bird recognizers by data reduction of audio features.

Standard Yorùbá context dependent tone identification using Multi-Class Support Vector Machine (MSVM)

An Adaptive Method for Robust Detection of Vowels in Noisy Environment

Speech spoofing countermeasures based on source voice analysis and machine learning techniques

A comparative study of deep neural network based Punjabi-ASR system

Continuous Tamil Speech Recognition technique under non stationary noisy environments

Acoustic signal processing with robust machine learning algorithm for improved monitoring of particulate solid materials in a gas flowline

Recent progress in deep end-to-end models for spoken language processing

A flexible discriminative approach to automatic phone and broad phonetic group classification

Bayesian hindcast of acoustic transmission loss in the western Pacific Ocean