Speech Feature Parameters Research Articles

Seismic facies analysis plays an important role in seismic stratigraphy. Seismic attributes have been widely applied to seismic facies analysis. One of the most important steps is to optimize the most sensitive attributes with regard to reservoir characteristics. Using different attribute combinations in multidimensional analyses will yield different solutions. Acoustic waves and seismic waves propagating in an elastic medium follow the same law of physics. The generation process of a speech signal based on the acoustic model is similar to the seismic data of the convolution model. We have developed the mel-frequency cepstrum coefficients (MFCCs), which have been successfully applied in speech recognition, as feature parameters for seismic facies analysis. Information about the wavelet and reflection coefficients is well-separated in these cepstrum-domain parameters. Specifically, information about the wavelet mainly appears in the low-domain part, and information about the reflection coefficients mainly appeared in the high-domain part. In the forward model, the seismic MFCCs are used as feature vectors for synthetic data with a noise level of zero and 5%. The Bayesian network is used to classify the traces. Then, classification accuracy rates versus different orders of the MFCCs are obtained. The forwarding results indicate that high accuracy rates are achieved when the order exceeds 10. For the real field data, the seismic data are decomposed into a set of MFCC parameters. The different information is unfolded in the parameter maps, enabling the interpreter to capture the geologic features of the target interval. The geologic features presented in the three instantaneous attributes and coherence can also be found in the MFCC parameter maps. The classification results are in accordance with the paleogeomorphy of the target interval as well as the known wells. The results from the synthetic data and real field data demonstrate the information description abilities of the seismic MFCC parameters. Therefore, using the speech feature parameters to extract information may be helpful for processing and interpreting seismic data.

Read full abstract

This paper proposes a frontend processing technique that employs a speech feature extraction method called Subband based Periodicity and Aperiodicity DEcomposition (SPADE), and examines its validity for automatic speech recognition in noisy environments. SPADE divides speech signals into subband signals, which are then decomposed into their periodic and aperiodic features, and uses both features as speech feature parameters. SPADE employs independent periodicity estimation within each subband and periodicity–aperiodicity decomposition design based on a parallel distributed processing technique motivated by the human speech perception process. Unlike other speech features, this decomposition of speech into two characteristics provides information about periodicities and aperiodicities, and thus allows the utilization of the robustness exhibited by periodic features without losing certain essential information included in aperiodic features. This paper first introduces an implementation of SPADE that operates in the frequency domain, and then examines the validity of combining SPADE with speech enhancement methods. For this examination, we combine SPADE with noise compensation methods that operate in the frequency domain and cepstral normalization methods. In addition, we employ an energy parameter calculation method based on the SPADE framework. An evaluation with the AURORA-2J noisy continuous digit speech recognition database (Japanese AURORA-2) shows that SPADE combined with adaptive Wiener filtering, cepstral normalization, and the energy parameter achieves average word accuracy rates of 82.58% with clean training and 92.55% with multicondition training. These rates are higher than those achieved with ETSI WI008 advanced DSR frontend processing (77.98% and 91.01%, respectively) whose speech feature parameter is based on conventional Mel-frequency cepstral coefficients. By comparison with ETSI WI008 advanced DSR frontend, the proposed method reduces word error rates by 20.9% with clean training and 17.2% with multicondition training. These results confirmed that SPADE combined with noise reduction methods can increase robustness in the presence of noise.

Read full abstract

Speech Feature Parameters Research Articles

Related Topics

Articles published on Speech Feature Parameters

A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters

A Recognition Method Based on Speech Feature Parameters-English Teaching Practice

Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features

The Emotion Recognition System Based on Support Vector Machines

A Probe into Spoken English Recognition in English Education Based on Computer-Aided Comprehensive Analysis

The Key Technology of Speech Interaction Based on Deep Learning

Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system

Seismic facies analysis based on speech recognition feature parameters

Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature

Speech Feature Parameter Extraction and Recognition Based on Interpolation

A Method of Feature Selection of Voice Content Classification Based on Analysis of Variance in Orthogonal Experiments

English Sentence Recognition Based on HMM and Clustering

Speech Recognition Approach Based on Speech Feature Clustering and HMM

Development of Application Specific Continuous Speech Recognition System in Hindi

Application Research of HHT-IF Speech Feature Parameter in Speaker Recognition System

Speaker recognition techniques for remote authentication of users in computer networks

Speech information processing method and apparatus and storage medium using a segment pitch pattern model

A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition

Teager energy based feature parameters for speech recognition in car noise

Speech recognition apparatus using syntactic and semantic analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech Feature Parameters Research Articles

Related Topics

Articles published on Speech Feature Parameters

A Study on Speech Recognition by a Neural Network Based on English Speech Feature Parameters

A Recognition Method Based on Speech Feature Parameters-English Teaching Practice

Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features

The Emotion Recognition System Based on Support Vector Machines

A Probe into Spoken English Recognition in English Education Based on Computer-Aided Comprehensive Analysis

The Key Technology of Speech Interaction Based on Deep Learning

Constructing accurate and robust HMM/GMM models for an Arabic speech recognition system

Seismic facies analysis based on speech recognition feature parameters

Reverberant speech recognition combining deep neural networks and deep autoencoders augmented with a phone-class feature

Speech Feature Parameter Extraction and Recognition Based on Interpolation

A Method of Feature Selection of Voice Content Classification Based on Analysis of Variance in Orthogonal Experiments

English Sentence Recognition Based on HMM and Clustering

Speech Recognition Approach Based on Speech Feature Clustering and HMM

Development of Application Specific Continuous Speech Recognition System in Hindi

Application Research of HHT-IF Speech Feature Parameter in Speaker Recognition System

Speaker recognition techniques for remote authentication of users in computer networks

Speech information processing method and apparatus and storage medium using a segment pitch pattern model

A feature extraction method using subband based periodicity and aperiodicity decomposition with noise robust frontend processing for automatic speech recognition

Teager energy based feature parameters for speech recognition in car noise

Speech recognition apparatus using syntactic and semantic analysis