Abstract

Entropy coding, which is an essential part of audio compression, is always required to manage the tradeoffs between compression efficiency and computational complexity, and the strategy to achieve them highly depends on the distributions of inputs. In this paper, we present a method of controlling them for enhancing the compression efficiency of Golomb-Rice (GR) encoding, one of the simplest entropy coding methods optimal for Laplacian distributions. We will show that the proposed invertible and low-complexity mapping of integers enables the GR encoding to assign nearly the optimal code length for a wider range of distributions, generalized Gaussian distributions, maintaining low computational cost. A simulation by random numbers reveals that the proposed coder based on this scheme works about 6 times faster than the state-of-the-art arithmetic coder for Gaussian-distributed integers maintaining the increase in relative redundancy around $\text{2.6}{\%}$ , which is much lower than that of a conventional GR coder. Additionally, an application to a practical speech and audio coding scheme is presented, and an objective evaluation for real speech and audio signals confirms the advantages of the proposed method in compression. The method is expected to widen the capability of low-complexity entropy coding, providing us with more flexible codec designs.

Highlights

  • O WING to the recent trends in the Internet of Things (IoT), data transmission is becoming more and more important, and speech and audio data are no exception

  • We focused on the code length for simulated random numbers of GR code combined with the proposed shape control with fixed parameter p and q

  • In this paper we presented an invertible mapping method that can approximately shape generalized-Gaussian-distributed integers into Laplacian-distributed ones, focusing on it use for Golomb-Rice (GR) encoding of frequency spectra

Read more

Summary

Introduction

O WING to the recent trends in the Internet of Things (IoT), data transmission is becoming more and more important, and speech and audio data are no exception These data have been mainly transmitted between humans, sometimes for communication and sometimes for broadcasting. With recent advances in speech recognition, sound detection, and speech synthesis we can expect a rising demand for speech and audio data transmission between humans and between humans and machines. In this context, speech and audio coding, or compression, will play a greater role.

Objectives
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.