Speech reconstruction for MFCC-based low bit-rate speech coding

Jiang Wenbin Jiang Wenbin,Liu Peilin Liu Peilin,Ying Rendong Ying Rendong

doi:10.1109/icmew.2014.6890586

Abstract

Speech reconstruction is a key issue in speech coding. In this paper, we propose an extended least-squares estimate, inverse short-time Fourier transforms magnitude (LSE-ISTFTM) speech reconstruction algorithm for MFCC-based low bit-rate speech coding. The proposed extended LSE-ISTFTM algorithm initializes speech with a specific signal rather than white noise, reconstructs voiced and unvoiced frames separately. Pitch frequency and voicing class are estimated from magnitude spectrum, which is inversed from MFCC, with Gaussian Mixture Model (GMM). The voicing classification and pitch estimation results show that the error is lower than 1% and 5.62%, respectively. The speech reconstruction results demonstrate that the proposed extended LSE-ISTFTM algorithm is more stable and converges faster than the LSE-ISTFTM algorithm. The speech coding results also show that the proposed algorithm has higher speech quality than the classic algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech reconstruction for MFCC-based low bit-rate speech coding

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A novel speech reconstruction algorithm for DSR back-end
Jiang Wenbin ... Ying Rendong
-
Jiang Wenbin, et. al.Jiang Wenbin ... Ying Rendong
01 Jul 2014
01 Jul 2014

All-pass excitation phase modelling for low bit-rate speech coding
B.M.G Cheetham ... X.Q Sun
-
B.M.G Cheetham, et. al.B.M.G Cheetham ... X.Q Sun
09 Jun 1997
09 Jun 1997

Phase modelling for low bit-rate speech coding
B.M.G Cheetham
-
B.M.G CheethamB.M.G Cheetham
01 Jan 1999
01 Jan 1999

A phase generation method for speech reconstruction from spectral envelope and pitch intervals
Hong-Goo Kang ... Hong Kook Kim
-
Hong-Goo Kang, et. al.Hong-Goo Kang ... Hong Kook Kim
01 May 2002
01 May 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech reconstruction for MFCC-based low bit-rate speech coding

Abstract

Talk to us

Similar Papers