Rate-Distortion-Perception Optimized Neural Speech Transmission System for High-Fidelity Semantic Communications.

Shengshi Yao,Kai Niu,Zixuan Xiao

doi:10.3390/s24103169

Abstract

We consider the problem of learned speech transmission. Existing methods have exploited joint source-channel coding (JSCC) to encode speech directly to transmitted symbols to improve the robustness over noisy channels. However, the fundamental limit of these methods is the failure of identification of content diversity across speech frames, leading to inefficient transmission. In this paper, we propose a novel neural speech transmission framework named NST. It can be optimized for superior rate-distortion-perception (RDP) performance toward the goal of high-fidelity semantic communication. Particularly, a learned entropy model assesses latent speech features to quantify the semantic content complexity, which facilitates the adaptive transmission rate allocation. NST enables a seamless integration of the source content with channel state information through variable-length joint source-channel coding, which maximizes the coding gain. Furthermore, we present a streaming variant of NST, which adopts causal coding based on sliding windows. Experimental results verify that NST outperforms existing speech transmission methods including separation-based and JSCC solutions in terms of RDP performance. Streaming NST achieves low-latency transmission with a slight quality degradation, which is tailored for real-time speech communication.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Rate-Distortion-Perception Optimized Neural Speech Transmission System for High-Fidelity Semantic Communications.

Abstract

Talk to us

Similar Papers

More From: Sensors

Lead the way for us

Journal: Sensors	Publication Date: May 16, 2024
License type: CC BY 4.0

Similar Papers

Analog joint source channel coding over MIMO fading channels with imperfect CSI
Jose P Gonzalez-Coma ... Luis Castedo
-
Jose P Gonzalez-Coma, et. al.Jose P Gonzalez-Coma ... Luis Castedo
01 Jun 2016
01 Jun 2016

Optimal Communication For Sources And Channels With Memory And Delay-sensitive Applications
Christos K Kourtellaris ... Photios Stavrou
-
Christos K Kourtellaris, et. al.Christos K Kourtellaris ... Photios Stavrou
01 Jan 2014
01 Jan 2014

Design and analysis of MIMO joint source channel coding (JSCC) with limited feedback
Tianyu Wu ... V.K.N Lau
IEEE Transactions on Wireless Communications | VOL. 8
Tianyu Wu, et. al. Tianyu Wu ... V.K.N Lau
01 Jun 2009
IEEE Transactions on Wireless Communications | VOL. 8

Evaluation of Analog Joint Source-Channel Coding Systems for Multiple Access Channels
Oscar Fresnedo ... Jose P Gonzalez-Coma
IEEE Transactions on Communications | VOL. 63
Oscar Fresnedo, et. al.Oscar Fresnedo ... Jose P Gonzalez-Coma
01 Jun 2015
IEEE Transactions on Communications | VOL. 63

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Rate-Distortion-Perception Optimized Neural Speech Transmission System for High-Fidelity Semantic Communications.

Abstract

Talk to us

Similar Papers

More From: Sensors