Modified Bark Spectral Distortion Research Articles

The paper presents a speech generative model that provides an efficient way of generating speech waveform from its amplitude spectral envelopes. The model is based on hybrid speech representation that includes deterministic (harmonic) and stochastic (noise) components. The main idea behind the approach originates from the fact that speech signal has a determined spectral structure that is statistically bound with deterministic/stochastic energy distribution in the spectrum. The performance of the model is evaluated using an experimental low-bitrate wide-band speech coder. The quality of reconstructed speech is evaluated using objective and subjective methods. Two objective quality characteristics were calculated: Modified Bark Spectral Distortion (MBSD) and Perceptual Evaluation of Speech Quality (PESQ). Narrow-band and wide-band versions of the proposed solution were compared with MELP (Mixed Excitation Linear Prediction) speech coder and AMR (Adaptive Multi-Rate) speech coder, respectively. The speech base of two female and two male speakers were used for testing. The performed tests show that overall performance of the proposed approach is speaker-dependent and it is better for male voices. Supposedly, this difference indicates the influence of pitch highness on separation accuracy. In that way, using the proposed approach in experimental speech compression system provides decent MBSD values and comparable PESQ values with AMR speech coder at 6,6 kbit/s. Additional subjective listening testsdemonstrate that the implemented coding system retains phonetic content and speaker’s identity. It proves consistency of the proposed approach.

Read full abstract

The increasing importance of multimedia applications is placing a great insist on content protection and customer privacy. Communications can be intercepted, especially over wireless links. Since encryption can effectively prevent eavesdropping, its use is widely advocated. The codec G. 729 based CS-ACELP algorithm is standardized as voice codec by ITU-T for multimedia and Voice over Internet Protocol (VoIP) applications. In this paper, we introduce a speech encryption method based chaotic cat map algorithm. Cat map extended to two-dimensional NxN matrix. It takes concepts from linear algebra and uses them to change the positions of the values of the matrix. The result after applying the Cat Map will be shuffled signals that contain the same values of the original signals. We applied our encryption scheme to the standard ITU-T G.729 standard speech coder to evaluate its performance. Simulation results show that G.729 based cat map encryption is very efficient since the encrypted speech is similar to a white noise. The perceptual evaluation of speech quality (PESQ) and enhanced modified bark spectral distortion (EMBSD) tests for speech extracted from TIMIT database confirm the efficiency of our proposed scheme.

Read full abstract

Modified Bark Spectral Distortion Research Articles

Related Topics

Articles published on Modified Bark Spectral Distortion

AN EFFICIENT SPEECH GENERATIVE MODEL BASED ON DETERMINISTIC/STOCHASTIC SEPARATION OF SPECTRAL ENVELOPES

A Human Auditory Perception Loss Function Using Modified Bark Spectral Distortion for Speech Enhancement

Efficient speech encryption using chaotic cat map for code-excited linear prediction based coders in packet networks

Intraframe quantization of speech line spectrum pairs for code-excited linear prediction based coders in packet networks

Psychoacoustically Constrained and Distortion Minimized Speech Enhancement

Perceptual improvement of Wiener filtering employing a post-filter

Adaptive noise spectral estimation for spectral subtraction speech enhancement

Audible Noise Reduction in Eigendomain for Speech Enhancement

Speech enhancement using constrained spectral amplitude subtraction based on noncausal a priori SNR

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Modified Bark Spectral Distortion Research Articles

Related Topics

Articles published on Modified Bark Spectral Distortion

AN EFFICIENT SPEECH GENERATIVE MODEL BASED ON DETERMINISTIC/STOCHASTIC SEPARATION OF SPECTRAL ENVELOPES

A Human Auditory Perception Loss Function Using Modified Bark Spectral Distortion for Speech Enhancement

Efficient speech encryption using chaotic cat map for code-excited linear prediction based coders in packet networks

Intraframe quantization of speech line spectrum pairs for code-excited linear prediction based coders in packet networks

Psychoacoustically Constrained and Distortion Minimized Speech Enhancement

Perceptual improvement of Wiener filtering employing a post-filter

Adaptive noise spectral estimation for spectral subtraction speech enhancement

Audible Noise Reduction in Eigendomain for Speech Enhancement

Speech enhancement using constrained spectral amplitude subtraction based on noncausal a priori SNR