Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

Zhenzi Weng,Zhijin Qin,Geoffrey Ye Li,Guangyi Liu,Chengkang Pan,Xiaoming Tao

doi:10.1109/twc.2023.3240969

Abstract

In this paper, we develop a deep learning based semantic communication system for speech transmission, named DeepSC-ST. We take the speech recognition and speech synthesis as the transmission tasks of the communication system, respectively. First, the speech recognition-related semantic features are extracted for transmission by a joint semantic-channel encoder and the text is recovered at the receiver based on the received semantic features, which significantly reduces the required amount of data transmission without performance degradation. Then, we perform speech synthesis at the receiver, which dedicates to re-generate the speech signals by feeding the recognized text and the speaker information into a neural network module. To enable the DeepSC-ST adaptive to dynamic channel environments, we identify a robust model to cope with different channel conditions. According to the simulation results, the proposed DeepSC-ST significantly outperforms conventional communication systems and existing DL-enabled communication systems, especially in the low signal-to-noise ratio (SNR) regime. A software demonstration is further developed as a proof-of-concept of the DeepSC-ST.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Wireless Communications	Publication Date: Sep 1, 2023
Citations: 47	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Wireless Communications

Lead the way for us

Similar Papers

Semantic Communications for Speech Recognition
Zhenzi Weng ... Zhijin Qin
-
Zhenzi Weng, et. al.Zhenzi Weng ... Zhijin Qin
01 Dec 2021
01 Dec 2021

A Bilingual Kazakh-Russian System for Automatic Speech Recognition and Synthesis
Olga Khomitsevich ... Valentin Mendelev
-
Olga Khomitsevich, et. al.Olga Khomitsevich ... Valentin Mendelev
01 Jan 2015
01 Jan 2015

New Systems and Architectures for Automatic Speech Recognition and Synthesis
Ching Y Suen
-
Ching Y SuenChing Y Suen
01 Jan 1985
01 Jan 1985

Impact Analysis of Emerging Semantic Communication Systems on Network Performance
Harim Lee ... Hyeongtae Ahn
Electronics | VOL. 11
Harim Lee, et. al.Harim Lee ... Hyeongtae Ahn
13 May 2022
Electronics | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning Enabled Semantic Communications With Speech Recognition and Synthesis

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Wireless Communications