Neurally Optimized Decoder for Low Bitrate Speech Codec

Hyung Yong Kim,Won Ik Cho,Nam Soo Kim,Ji Won Yoon

doi:10.1109/lsp.2021.3132557

Abstract

Recently, a conventional neural decoder for speech codec has shown promising performance. However, it typically requires some prior knowledge of decoding such as bit allocation or dequantization scheme, which is not a universal solution for many different kinds of speech codecs. In order to address this limitation, we propose a neurally optimized decoder based on a generative model which can directly reconstruct the speech from the bitstream without a prior knowledge. The proposed decoder mainly consists of two components: 1) a dequantization model to group and dequantize related bits from the bitstream and 2) a generative model to restore the speech conditioned on the output of the dequantization model. Through experiments with mixed excitation linear prediction (MELP), Advanced multi-band excitation (AMBE), and SPEEX at around 2.4 kb/s, it is showed that the proposed model showed better performance in most of the objective and subjective evaluation compared to the conventional speech codecs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Neurally Optimized Decoder for Low Bitrate Speech Codec

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2022
Citations: 2

Similar Papers

Robust Transmission of Multistage Vector Quantized Sources Over Noisy Communication Channels—Applications to MELP Speech Codec
Farshad Lahouti ... Amir K Khandani
IEEE Transactions on Vehicular Technology | VOL. 55
Farshad Lahouti, et. al.Farshad Lahouti ... Amir K Khandani
01 Nov 2006
IEEE Transactions on Vehicular Technology | VOL. 55

A low bit rate speech codec using mixed excitation linear prediction for private mobile radio
Seishi Sasaki ... Teruo Fumoto
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87
Seishi Sasaki, et. al.Seishi Sasaki ... Teruo Fumoto
20 May 2004
Electronics and Communications in Japan (Part II: Electronics) | VOL. 87

High quality MELP coding at bit-rates around 4 kb/s
J Stachurski ... A Mccree
-
J Stachurski, et. al.J Stachurski ... A Mccree
01 Jan 1998
01 Jan 1998

An improved mixed excitation linear prediction (MELP) coder
T Unno ... Kwan Truong
-
T Unno, et. al.T Unno ... Kwan Truong
01 Jan 1998
01 Jan 1998

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Neurally Optimized Decoder for Low Bitrate Speech Codec

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters