Frequency bark cepstral coefficients extraction for speech analysis by synthesis.

Abel Herrera,Fernando Del Rio

doi:10.1121/1.3508042

Abstract

The Mel scale was proposed in 1937, following a series of experiments used to establish a perceptual scale. The use of the Mel scale is almost standard for speech recognition application. The Bark scale divides the audible spectrum into 24 critical bands that try to mimic the frequency response of the human ear. This article describes the process used to extract a set of cepstral coefficients from a warped frequency space (Mel and Bark) and analyze the perceived differences in the reconstructed signal. We will try to determine if there is any audible improvement between these two most warping functions for the purpose of speech analysis by synthesis. We will use the same procedure for parameter extraction and signal reconstruction for both functions, replacing only the warping scale used, to minimize the distortion other elements might add to the results. After running the waveform through both processes and reconstructing a wave signal from the parameters, while the resulting output was somewhat different, there were slight differences between the bark and mel generated signals. Statistical tests are now running between Mel and Bark scales.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Frequency bark cepstral coefficients extraction for speech analysis by synthesis.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Oct 1, 2010
Citations: 5

Similar Papers

Speech Compression Based on Frequency Warped Cepstrum and Wavelet Analysis
Francisco J Ayala ... Abel Herrera
-
Francisco J Ayala, et. al.Francisco J Ayala ... Abel Herrera
01 Jan 2010
01 Jan 2010

Performance Evaluation of Mel and Bark Scale based Features for Text-Independent Speaker Identification
Dr S B Dhonde ... Amol A Chaudhari
International Journal of Innovative Technology and Exploring Engineering | VOL. 8
Dr S B Dhonde, et. al.Dr S B Dhonde ... Amol A Chaudhari
30 Sep 2019
International Journal of Innovative Technology and Exploring Engineering | VOL. 8

Robust emotion recognition from speech: Gamma tone features and models
A Revathi ... R Nagakrishnan
International Journal of Speech Technology | VOL. 21
A Revathi, et. al.A Revathi ... R Nagakrishnan
04 Aug 2018
International Journal of Speech Technology | VOL. 21

A novel approach in feature level for robust text-independent speaker identification system
Susanta Kumar Sarangi ... Goutam Saha
-
Susanta Kumar Sarangi, et. al.Susanta Kumar Sarangi ... Goutam Saha
01 Dec 2012
01 Dec 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Frequency bark cepstral coefficients extraction for speech analysis by synthesis.

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America