Noisy Speech Signal Research Articles

Purpose The purpose of the present work is to design robust estimators for speech enhancement by incorporation of calculation rank-order statistics and locally-adaptive neighborhoods. The proposed estimators are able to increase the speech quality of a noisy signal, to preserve better speech intelligibility, and to introduce less artifacts comparing with known speech enhancement estimators. Design/methodology/approach We design a novel speech enhancement algorithm based on rank-order statistics and local adaptive signal processing to improve the accuracy of existing speech enhancement estimators, in terms of speech quality, intelligibility, and introduction of artificial artifacts. Findings We found that by using the proposed estimators for speech enhancement we obtain a better adaptation to nonstationary characteristics of speech and noise processes comparing with that of known speech enhancement estimators. The proposed algorithm increases speech quality, preserves better speech intelligibility, and introduces less artifacts comparing with known speech enhancement estimators. Research limitations/implications The proposed approach for speech enhancement is a locally-adaptive signal processing performed for each element of a noisy speech signal. Thus, the main limitation of the proposed approach is an increase of computational complexity compared with that of nonadaptive conventional techniques. Practical implications In order to perform real-time speech enhancement with the proposed approach, it is recommended to use a digital system with a fast processor. Another option is by using a parallel architecture such as a FPGA. Originality/value We propose a novel local-adaptive algorithm for robust speech enhancement by incorporation of calculation of rank-order statistics and local-adaptive neighborhoods. The proposed algorithm is able to adjust itself in response to changes in the statistical properties of ambience noise.

Read full abstract

Vowel onset point (VOP) is the instant of time at which vowel region starts in a speech signal. VOP plays a vital role in different applications of speech processing, such as syllable detection, speaker verification, duration modification, language identification etc. There are different existing algorithms for the detection of instance of VOP in a speech signal. The algorithm based on the combined evidences extracted from the source excitation, spectral peaks and modulation spectrum has been used as a baseline system for the present work. The baseline system performs well under clean speech data. However, under noisy conditions the performance of the baseline system degrades. The performance of the system degrades in terms of more number of spurious VOPs, which get detected under noisy speech conditions. According to the available literature, this degraded performance is due to the spectral broadening of the speech in the noisy environments. In this paper we have proposed a pre-processing technique on top of the baseline system to reduce this spectral broadening effect of noise. The noisy speech data are passed through the pre-processing algorithm in order to minimize the spectral broadening effect of speech signal. The pre-processed speech is then passed through the baseline system to detect the VOPs in the speech signal. Experiments were carried out under clean and different noisy speech signals. The results of the experiment show an improvement of 16–21% in terms of removal of spurious VOPs, over the existing baseline system under different noisy speech conditions. Further, the performance of the proposed method has been compared with two different best performing techniques for detection of VOP, and found that the proposed method gives a superior level of performance in terms of identification accuracy and identification rate.

Read full abstract

Noisy Speech Signal Research Articles

Related Topics

Articles published on Noisy Speech Signal

Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition

Research on Speech Endpoint Detection Algorithm with Low SNR

A Method for Voiced/Unvoiced Classification of Noisy Speech by Analyzing Time-Domain Features of Spectrogram Image

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

Performance Analysis of Adaptive Algorithms for Speech Enhancement Applications

Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification

Speech Enhancement using Kalman Filter with Preprocessed Digital Expander in Noisy Environment

Speech Enhancement Using Iterative Kalman Filter with Time and Frequency Mask in Different Noisy Environment

Mean-Median based Noise Estimation Method using Spectral Subtraction for Speech Enhancement Technique

Sparse Representations for Single Channel Speech Enhancement Based on Voiced/Unvoiced Classification

A wavelet- based transform method for quality improvement in noisy speech patterns of Arabic language

Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework

EEG-Informed Attended Speaker Extraction From Recorded Speech Mixtures With Application in Neuro-Steered Hearing Prostheses

AN EFFICIENT MODIFIED CUCKOO SEARCH BASED NOISE DESTRUCTION AND IMPROVEMENT OF SPEECH SIGNAL QUALITY

Speech enhancement using robust estimators and rank-order statistics

Speech Enhancement Using Multi‐channel Post‐Filtering with Modified Signal Presence Probability in Reverberant Environment

Speech Enhancement based on Wiener Filter and Compressive Sensing

A pre-processing method for improvement of vowel onset point detection under noisy conditions

An efficient frequency-domain adaptive forward BSS algorithm for acoustic noise reduction and speech quality enhancement

Phase distortion resulting in a just noticeable difference in the perceived quality of speech

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Noisy Speech Signal Research Articles

Related Topics

Articles published on Noisy Speech Signal

Spectral Reconstruction and Noise Model Estimation Based on a Masking Model for Noise Robust Speech Recognition

Research on Speech Endpoint Detection Algorithm with Low SNR

A Method for Voiced/Unvoiced Classification of Noisy Speech by Analyzing Time-Domain Features of Spectrogram Image

Speech enhancement based on neural networks improves speech intelligibility in noise for cochlear implant users

Performance Analysis of Adaptive Algorithms for Speech Enhancement Applications

Enhanced multiclass SVM with thresholding fusion for speech-based emotion classification

Speech Enhancement using Kalman Filter with Preprocessed Digital Expander in Noisy Environment

Speech Enhancement Using Iterative Kalman Filter with Time and Frequency Mask in Different Noisy Environment

Mean-Median based Noise Estimation Method using Spectral Subtraction for Speech Enhancement Technique

Sparse Representations for Single Channel Speech Enhancement Based on Voiced/Unvoiced Classification

A wavelet- based transform method for quality improvement in noisy speech patterns of Arabic language

Estimating acoustic speech features in low signal-to-noise ratios using a statistical framework

EEG-Informed Attended Speaker Extraction From Recorded Speech Mixtures With Application in Neuro-Steered Hearing Prostheses

AN EFFICIENT MODIFIED CUCKOO SEARCH BASED NOISE DESTRUCTION AND IMPROVEMENT OF SPEECH SIGNAL QUALITY

Speech enhancement using robust estimators and rank-order statistics

Speech Enhancement Using Multi‐channel Post‐Filtering with Modified Signal Presence Probability in Reverberant Environment

Speech Enhancement based on Wiener Filter and Compressive Sensing

A pre-processing method for improvement of vowel onset point detection under noisy conditions

An efficient frequency-domain adaptive forward BSS algorithm for acoustic noise reduction and speech quality enhancement

Phase distortion resulting in a just noticeable difference in the perceived quality of speech