Synthesis Filter Research Articles

Autoregressive models for the envelope of speech power spectral densities (PSDs) are refined by the self-supervised spectral learning machine (S3LM) provided with differentiable spectral objective functions, including the Itakura-Saito divergence (ISD), the Kullback-Leibler divergence (KLD), the reverse KLD (RKLD) and the log spectral distortion (LSD), which display more significant results. However, in order to assess the models more perceptually, a method is proposed based upon perturbations around perfect reconstruction analysis-synthesis configurations. In the cross-excitation analysis-synthesis assessment (CEASA) method, the residual signals generated by analysis filters of the spectral models are injected as excitation into the synthesis filters derived from the same and other models in order to be evaluated by the perceptual evaluation of speech quality (PESQ) and Itakura divergence (ID), which are averaged over a set of models obtained using the objective functions mentioned above. The results lead to a superior performance when the RKLD is used as the loss function for the estimation of the spectral models with the ISD ranking close behind. The focus of these divergences on the spectral peaks is argued and pointed as the most important factor for this behavior. Specifically, using the PESQ scores obtained with CEASA, the RKLD loss is found to improve the performance by 1.0%, 4.0% and 19.3% with respect to the open-loop analysis, the KLD and the LSD models, respectively, while the corresponding improvements for the ISD loss are 0.1%, 3.0% and 18.2%, and the RKLD models excel the ISD models by 1.0% on average. Even though the spectral measures alone are not able to unequivocally distinguish the better of the two, CEASA is shown to have enough sensitivity to distinguish their performances. In summary, the learning machine S3LM fits models for the short-term spectral envelope of speech and, for the evaluation of its performance under several differentiable loss functions, the CEASA assessment tool has been developed. In addition, CEASA may be used for other assessments connected with speech analysis and synthesis.

Speech enhancement is one of the most important fields in audio and speech signal processing. The speech enhancement methods are divided into the single and multi-channel algorithms. The multi-channel methods increase the speech enhancement performance by providing more information with the use of more microphones. In addition, spatial aliasing is one of the destructive factors in speech enhancement strategies. In this article, we first propose a uniform circular nested microphone array (CNMA) for data recording. The microphone array increases the accuracy of the speech processing methods by increasing the information. Moreover, the proposed nested structure eliminates the spatial aliasing between microphone signals. The circular shape in the proposed nested microphone array implements the speech enhancement algorithm with the same probability for the speakers in all directions. In addition, the speech signal information is different in frequency bands, where the sub-band processing is proposed by the use of the analysis filter bank. The frequency resolution is increased in low frequency components by implementing the proposed filter bank. Then, the affine projection algorithm (APA) is implemented as an adaptive filter on sub-bands that were obtained by the proposed nested microphone array and analysis filter bank. This algorithm adaptively enhances the noisy speech signal. Next, the synthesis filters are implemented for reconstructing the enhanced speech signal. The proposed circular nested microphone array in combination with the sub-band affine projection algorithm (CNMA-SBAPA) is compared with the least mean square (LMS), recursive least square (RLS), traditional APA, distributed multichannel Wiener filter (DB-MWF), and multichannel nonnegative matrix factorization-minimum variance distortionless response (MNMF-MVDR) in terms of the segmental signal-to-noise ratio (SegSNR), perceptual evaluation of speech quality (PESQ), mean opinion score (MOS), short-time objective intelligibility (STOI), and speed of convergence on real and simulated data for white and colored noises. In all scenarios, the proposed method has high accuracy at different levels and noise types by the lower distortion in comparison with other works and, furthermore, the speed of convergence is higher than the compared researches.

Synthesis Filter Research Articles

Related Topics

Articles published on Synthesis Filter

Demonstration of a terahertz integrated planar network synthesis filter

Reconstruction of a signal from multirate observations: A recursive approach

Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT

Channel estimation based on interference cancellation method for FBMC-PON

A New Approach to the Design and Implementation of a Family of Multiplier Free Orthogonal Wavelet Filter Banks

Differentiable Measures for Speech Spectral Modeling

Research of application of DFT-modulated filter bank in systems with significant spectral component amplification

Artificial bandwidth extension using [formula omitted] sampled-data control theory

NANO-studio, the design environment of filter banks implemented in standard CMOS technology

Spectral Domain Spline Graph Filter Bank

High‐band feature extraction for artificial bandwidth extension using deep neural network andH∞optimisation

Особенности очистки отработавших газов судовых энергетических установок в пористых проницаемых каталитических материалах

Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Improvement of LMS adaptive noise canceller using uniform Poly-phase digital filter bank

Design of Transmultiplexer Filter Banks Using Ramanujan Sums

On the Design of Two-Dimensional Quincunx Filterbanks with Directional Vanishing Moment Based on Eigenfilter Approach

Oversampled DFT-Modulated Biorthogonal Filter Banks: Perfect Reconstruction Designs and Multiplierless Approximations

MWA tied-array processing III: Microsecond time resolution via a polyphase synthesis filter

Nonsubsampled Graph Filter Banks: Theory and Distributed Algorithms

Redefined Block-Lifting-Based Filter Banks With Efficient Reversible Nonexpansive Convolution

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Synthesis Filter Research Articles

Related Topics

Articles published on Synthesis Filter

Demonstration of a terahertz integrated planar network synthesis filter

Reconstruction of a signal from multirate observations: A recursive approach

Pseudo-Bayesian Approach for Robust Mode Detection and Extraction Based on the STFT

Channel estimation based on interference cancellation method for FBMC-PON

A New Approach to the Design and Implementation of a Family of Multiplier Free Orthogonal Wavelet Filter Banks

Differentiable Measures for Speech Spectral Modeling

Research of application of DFT-modulated filter bank in systems with significant spectral component amplification

Artificial bandwidth extension using [formula omitted] sampled-data control theory

NANO-studio, the design environment of filter banks implemented in standard CMOS technology

Spectral Domain Spline Graph Filter Bank

High‐band feature extraction for artificial bandwidth extension using deep neural network andH∞optimisation

Особенности очистки отработавших газов судовых энергетических установок в пористых проницаемых каталитических материалах

Multiresolution Speech Enhancement Based on Proposed Circular Nested Microphone Array in Combination with Sub-Band Affine Projection Algorithm

Improvement of LMS adaptive noise canceller using uniform Poly-phase digital filter bank

Design of Transmultiplexer Filter Banks Using Ramanujan Sums

On the Design of Two-Dimensional Quincunx Filterbanks with Directional Vanishing Moment Based on Eigenfilter Approach

Oversampled DFT-Modulated Biorthogonal Filter Banks: Perfect Reconstruction Designs and Multiplierless Approximations

MWA tied-array processing III: Microsecond time resolution via a polyphase synthesis filter

Nonsubsampled Graph Filter Banks: Theory and Distributed Algorithms

Redefined Block-Lifting-Based Filter Banks With Efficient Reversible Nonexpansive Convolution