Quality Of Extracted Features Research Articles

Image fusion is a technique of combining two or more images into a single image which is more informative from an interpretation point of view. With the rapid development of different synthetic aperture radar sensing satellites capturing information from the earth by measuring energy in different portions of the electromagnetic spectrum (narrow/wide-band), complementary information about the area captured by different satellites is available (e.g. high-resolution spectral and RGB images). However, the estimation of the full-resolution image may not be necessary for inference approaches, including the pixel-based classification. Instead, it is desirable to extract the relevant information embedded in the available data to improve the inference capabilities. This work proposes a computational framework to estimate features with high-spatial-resolution and appropriate spectral content by combining information from a multi-sensor system. The considered multi-sensor setup is a hyperspectral imaging system with a complementary RGB sensor. The proposed framework first extracts spatial features from the RGB image using morphological profiles. Then, the fusion model assumes that the extracted features, and the hyperspectral measurements, lie in different subspaces matrices. In addition, this work developed a joint optimization scheme to solve the feature fusion problem by integrating the alternating direction method of multipliers with the block coordinate descent method. The alternating optimization method estimates the spatial features in the fusion model by penalizing the <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\ell _{1}$</tex-math></inline-formula> -norm of the spatial gradient magnitudes. The quality of extracted features is measured in terms of supervised pixel-based classification methods. Extensive simulations show that the proposed approach outperforms other state-of-the-art methods in terms of classification accuracy.

Read full abstract

Speech coding facilitates speech compression without perceptual loss that results in the elimination or deterioration of both speech and speaker specific features used for a wide range of applications like automatic speaker and speech recognition, biometric authentication, prosody evaluations etc. The present work investigates the effect of speech coding in the quality of features which include Mel Frequency Cepstral Coefficients, Gammatone Frequency Cepstral Coefficients, Power-Normalized Cepstral Coefficients, Perceptual Linear Prediction Cepstral Coefficients, Rasta-Perceptual Linear Prediction Cepstral Coefficients, Residue Cepstrum Coefficients and Linear Predictive Coding-derived cepstral coefficients extracted from codec compressed speech. The codecs selected for this study are G.711, G.729, G.722.2, Enhanced Voice Services, Mixed Excitation Linear Prediction and also three codecs based on compressive sensing frame work. The analysis also includes the variation in the quality of extracted features with various bit-rates supported by Enhanced Voice Services, G.722.2 and compressive sensing codecs. The quality analysis of extracted epochs, fundamental frequency and formants estimated from codec compressed speech was also performed here. In the case of various features extracted from the output of selected codecs, the variation introduced by Mixed Excitation Linear Prediction codec is the least due to its unique method for the representation of excitation. In the case of compressive sensing based codecs, there is a drastic improvement in the quality of extracted features with the augmentation of bit rate due to the waveform type coding used in compressive sensing based codecs. For the most popular Code Excited Linear Prediction codec based on Analysis-by-Synthesis coding paradigm, the impact of Linear Predictive Coding order in feature extraction is investigated. There is an improvement in the quality of extracted features with the order of linear prediction and the optimum performance is obtained for Linear Predictive Coding order between 20 and 30, and this varies with gender and statistical characteristics of speech. Even though the basic motive of a codec is to compress single voice source, the performance of codecs in multi speaker environment is also studied, which is the most common environment in majority of the speech processing applications. Here, the multi speaker environment with two speakers is considered and there is an augmentation in the quality of individual speeches with increase in diversity of mixtures that are passed through codecs. The perceptual quality of individual speeches extracted from the codec compressed speech is almost same for both Mixed Excitation Linear Prediction and Enhanced Voice Services codecs but regarding the preservation of features, the Mixed Excitation Linear Prediction codec has shown a superior performance over Enhanced Voice Services codec.

Read full abstract

Quality Of Extracted Features Research Articles

Related Topics

Articles published on Quality Of Extracted Features

Self-supervised feature learning for acoustic data analysis

Multi-Sensor Image Feature Fusion via Subspace-Based Approach Using $\ell _{1}$-Gradient Regularization

Deep Spatial-Temporal Feature Extraction and Lightweight Feature Fusion for Tool Condition Monitoring

Transfer learning for image classification using VGG19: Caltech-101 image data set.

Visual high dimensional industrial process monitoring based on deep discriminant features and t-SNE

SDFL-FC: Semisupervised Deep Feature Learning With Feature Consistency for Hyperspectral Image Classification

Emotion Recognition of Manipuri Speech using Convolution Neural Network

Modeling Large-Scale Industrial Processes by Multiple Deep Belief Networks With Lower-Pressure and Higher-Precision for Status Monitoring

Statistical Process Control with Intelligence Based on the Deep Learning Model

High-Quality Wavelets Features Extraction for Handwritten Arabic Numerals Recognition

Cross-modal hashing with semantic deep embedding

An investigation on the degradation of different features extracted from the compressed American English speech using narrowband and wideband codecs

SAR Automatic Target Recognition Using a Roto-Translational Invariant Wavelet-Scattering Convolution Network

Quality-Based Learning for Web Data Classification

Improving passive radio‐frequency identification localisation precision with moving direction estimation‐based feature improvement

A Drowsiness Detection Architecture using Feature Extraction Methodology

Theoretical analysis on feature extraction capability of class-augmented PCA

Feature extraction and image segmentation using self-organizing networks

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Quality Of Extracted Features Research Articles

Related Topics

Articles published on Quality Of Extracted Features

Self-supervised feature learning for acoustic data analysis

Multi-Sensor Image Feature Fusion via Subspace-Based Approach Using $\ell _{1}$-Gradient Regularization

Deep Spatial-Temporal Feature Extraction and Lightweight Feature Fusion for Tool Condition Monitoring

Transfer learning for image classification using VGG19: Caltech-101 image data set.

Visual high dimensional industrial process monitoring based on deep discriminant features and t-SNE

SDFL-FC: Semisupervised Deep Feature Learning With Feature Consistency for Hyperspectral Image Classification

Emotion Recognition of Manipuri Speech using Convolution Neural Network

Modeling Large-Scale Industrial Processes by Multiple Deep Belief Networks With Lower-Pressure and Higher-Precision for Status Monitoring

Statistical Process Control with Intelligence Based on the Deep Learning Model

High-Quality Wavelets Features Extraction for Handwritten Arabic Numerals Recognition

Cross-modal hashing with semantic deep embedding

An investigation on the degradation of different features extracted from the compressed American English speech using narrowband and wideband codecs

SAR Automatic Target Recognition Using a Roto-Translational Invariant Wavelet-Scattering Convolution Network

Quality-Based Learning for Web Data Classification

Improving passive radio‐frequency identification localisation precision with moving direction estimation‐based feature improvement

A Drowsiness Detection Architecture using Feature Extraction Methodology

Theoretical analysis on feature extraction capability of class-augmented PCA

Feature extraction and image segmentation using self-organizing networks