A Reinforcement Learning Approach to Speech Coding

Jerry Gibson,Hoontaek Oh

doi:10.3390/info13070331

Abstract

Speech coding is an essential technology for digital cellular communications, voice over IP, and video conferencing systems. For more than 25 years, the main approach to speech coding for these applications has been block-based analysis-by-synthesis linear predictive coding. An alternative approach that has been less successful is sample-by-sample tree coding of speech. We reformulate this latter approach as a multistage reinforcement learning problem with L step lookahead that incorporates exploration and exploitation to adapt model parameters and to control the speech analysis/synthesis process on a sample-by-sample basis. The minimization of the spectrally shaped reconstruction error to finite depth manages complexity and serves as an effective stand in for the overall subjective evaluation of reconstructed speech quality and intelligibility. Different control policies that attempt to persistently excite the system states and that encourage exploration are studied and evaluated. The resulting methods produce reconstructed speech quality competitive with the most popular speech codec utilized today. This new reinforcement learning formulation provides new insights and opens up new directions for system design and performance improvement.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information	Publication Date: Jul 11, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Reinforcement Learning Approach to Speech Coding

Abstract

Talk to us

Similar Papers

More From: Information

Lead the way for us

Similar Papers

Speech Coding Techniques
S.K Jagtap ... M.S Mulye
Procedia Computer Science | VOL. 49
S.K Jagtap, et. al.S.K Jagtap ... M.S Mulye
01 Jan 2015
Procedia Computer Science | VOL. 49

Speech coding research at Bell Laboratories
Bishnu S Atal
The Journal of the Acoustical Society of America | VOL. 115
Bishnu S AtalBishnu S Atal
01 May 2004
The Journal of the Acoustical Society of America | VOL. 115

Fifty years of progress in speech waveform coding
Bishnu S Atal
The Journal of the Acoustical Society of America | VOL. 116
Bishnu S AtalBishnu S Atal
01 Oct 2004
The Journal of the Acoustical Society of America | VOL. 116

Predictive coding of speech using microphone/speaker adaptation and vector quantization
A.I Aarskog ... H.C Guren
IEEE Transactions on Speech and Audio Processing | VOL. 2
A.I Aarskog, et. al.A.I Aarskog ... H.C Guren
01 Apr 1994
IEEE Transactions on Speech and Audio Processing | VOL. 2

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Reinforcement Learning Approach to Speech Coding

Abstract

Talk to us

Similar Papers

More From: Information