A subspace based progressive coding method for speech compression

Serkan Keser,Ömer Nezih Gerek,Erol Seke,Mehmet Bilginer Gülmezoğlu

doi:10.1016/j.specom.2017.09.002

Abstract

In this study, two novel methods, which are based on Karhunen Loeve Transform (KLT) and Independent Component Analysis (ICA), are proposed for coding of speech signals. Instead of immediately dealing with eigenvalue magnitudes, the KLT- and ICA-based methods use eigenvectors of covariance matrices (or independent components for ICA) by geometrically grouping these vectors into fewer numbers of vectors. In this way, a data representation compaction is achieved. Further compression is achieved through discarding autocovariance eigenvectors corresponding to the small eigenvalues and applying vector quantization on the remaining eigenvectors. Additionally, this study proposes an iterative error refinement process, which uses the rest of the available bandwidth in order to transmit an efficient representation of the description error for better SNR. The overall process constitutes a new approach to efficient speech coding, with ICA being used in subspace speech coding for the first time. Constant bit rate (CBR) and variable bit rate (VBR) coding algorithms are employed with the proposed methods. TIMIT speech database is used in the experimental studies. Speech signals are synthesized at 2.4 kbps, 8 kbps, 12.2 kbps, 16 kbps, 16.4kbps and 19.85 kbps rates by using various frame lengths. The qualities of synthesized speech signals are compared to those of available speech codecs, i.e., LPC (2.4 kbps), G.728 (LD-CELP, 16 kbps), G.729A (CS-CELP, 8 kbps), EVS (16.4 kbps), AMR-NB (12.2 kbps) and AMR-WB (19.85 kbps).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A subspace based progressive coding method for speech compression

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Journal: Speech Communication	Publication Date: Sep 13, 2017
Citations: 7

Similar Papers

Continuous variable sampling rate, application on speech
S Elramly ... M El-Shafie
-
S Elramly, et. al.S Elramly ... M El-Shafie
01 Jul 1997
01 Jul 1997

Quality-optimised MPEG2 video data rate control using fuzzy logic techniques
Y.-S Saw ... P.M Grant
IEE Proceedings - Vision, Image, and Signal Processing | VOL. 145
Y.-S Saw, et. al.Y.-S Saw ... P.M Grant
01 Jan 1998
IEE Proceedings - Vision, Image, and Signal Processing | VOL. 145

An end—to—end performance comparison methodology for ATM transport of CBR and VBR encoded digital video
Ragip Kurçeren ... Jim W Modestino
European Transactions on Telecommunications | VOL. 12
Ragip Kurçeren, et. al.Ragip Kurçeren ... Jim W Modestino
01 May 2001
European Transactions on Telecommunications | VOL. 12

Compression of Multicomponent Satellite Images Using Independent Components Analysis
Isidore Paul Akam Bita ... Dinh-Tuan Antoine Pham
-
Isidore Paul Akam Bita, et. al.Isidore Paul Akam Bita ... Dinh-Tuan Antoine Pham
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A subspace based progressive coding method for speech compression

Abstract

Talk to us

Similar Papers

More From: Speech Communication