A Multi-Frame PCA-Based Stereo Audio Coding Method

Jing Wang,Xiaohan Zhao,Xiang Xie,Jingming Kuang

doi:10.3390/app8060967

Jing Wang, Xiaohan Zhao + Show 2 more

Open Access

https://doi.org/10.3390/app8060967

Copy DOI

Journal: Applied Sciences	Publication Date: Jun 12, 2018
Citations: 5	License type: CC BY 4.0

Affiliation: Beijing Institute of Technology

Abstract

With the increasing demand for high quality audio, stereo audio coding has become more and more important. In this paper, a multi-frame coding method based on Principal Component Analysis (PCA) is proposed for the compression of audio signals, including both mono and stereo signals. The PCA-based method makes the input audio spectral coefficients into eigenvectors of covariance matrices and reduces coding bitrate by grouping such eigenvectors into fewer number of vectors. The multi-frame joint technique makes the PCA-based method more efficient and feasible. This paper also proposes a quantization method that utilizes Pyramid Vector Quantization (PVQ) to quantize the PCA matrices proposed in this paper with few bits. Parametric coding algorithms are also employed with PCA to ensure the high efficiency of the proposed audio codec. Subjective listening tests with Multiple Stimuli with Hidden Reference and Anchor (MUSHRA) have shown that the proposed PCA-based coding method is efficient at processing stereo audio.

Highlights

The goal of audio coding is to represent audio in digital form with as few bits as possible while maintaining the intelligibility and quality required for particular applications [1]
It is very important to deal with the stereo signal efficiently, which can offer better experiences of using applications like mobile communication and live audio broadcasting
Intensity stereo is supported by many audio compression formats such as Advanced Audio Coding (AAC) [5,6], which is used for the transfer of relatively low bit rate, acceptable-quality audio with modest internet access speed

Summary

Introduction

The goal of audio coding is to represent audio in digital form with as few bits as possible while maintaining the intelligibility and quality required for particular applications [1]. It is very important to deal with the stereo signal efficiently, which can offer better experiences of using applications like mobile communication and live audio broadcasting. Over these years, a variety of techniques for stereo signal processing have been proposed [2,3], including M/S stereo, intensity stereo, joint stereo, and parametric stereo. Intensity stereo works on the principle of sound localization [4]: humans have a less keen sense of perceiving the direction of certain audio frequencies By exploiting this characteristic, intensity stereo coding can reduce the bitrate with little or no perceived change in apparent quality. The idea behind parametric stereo coding is to maximize the compression of a stereo signal by transmitting parameters describing the spatial image

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Multi-Frame PCA-Based Stereo Audio Coding Method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

An embedded stereo speech and audio coding method based on principal component analysis
Mao-Shen Jia ... Xin Liu
-
Mao-Shen Jia, et. al.Mao-Shen Jia ... Xin Liu
01 Dec 2011
01 Dec 2011

Tensor completion for recovering multichannel audio signal with missing data
...
China Communications | VOL. 16
, et. al. ...
22 Apr 2019
China Communications | VOL. 16

Comparing glottal-flow-excited statistical parametric speech synthesis methods
Tuomo Raitio ... Paavo Alku
-
Tuomo Raitio, et. al.Tuomo Raitio ... Paavo Alku
01 May 2013
01 May 2013

Wavelet-based corner detection using eigenvectors of covariance matrices
Chi-Hao Yeh
Pattern Recognition Letters | VOL. 24
Chi-Hao YehChi-Hao Yeh
25 Jun 2003
Pattern Recognition Letters | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Multi-Frame PCA-Based Stereo Audio Coding Method

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied Sciences