CSLNSpeech: Solving the extended speech separation problem with the help of Chinese sign language

Jiasong Wu,Xuan Li,Taotao Li,Fanman Meng,Youyong Kong,Guanyu Yang,Lotfi Senhadji,Huazhong Shu

doi:10.1016/j.specom.2024.103131

Abstract

Previous audio-visual speech separation methods synchronize the speaker's facial movement and speech in the video to self-supervise the speech separation. In this paper, we propose a model to solve the speech separation problem assisted by both face and sign language, which we call the extended speech separation problem. We design a general deep learning network to learn the combination of three modalities, audio, face, and sign language information, to solve the speech separation problem better. We introduce a large-scale dataset named the Chinese Sign Language News Speech (CSLNSpeech) dataset to train the model, in which three modalities coexist: audio, face, and sign language. Experimental results show that the proposed model performs better and is more robust than the usual audio-visual system. In addition, the sign language modality can also be used alone to supervise speech separation tasks, and introducing sign language helps hearing-impaired people learn and communicate. Last, our model is a general speech separation framework and can achieve very competitive separation performance on two open-source audio-visual datasets. The code is available at https://github.com/iveveive/SLNSpeech

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CSLNSpeech: Solving the extended speech separation problem with the help of Chinese sign language

Abstract

Talk to us

Similar Papers

More From: Speech Communication

Lead the way for us

Similar Papers

Decision letter: Early language exposure affects neural mechanisms of semantic representations
Jamie Reilly ... Floris P de Lange
-
Jamie Reilly, et. al.Jamie Reilly ... Floris P de Lange
23 Jan 2023
23 Jan 2023

Author response: Early language exposure affects neural mechanisms of semantic representations
Xiaosha Wang ... Yanchao Bi
-
Xiaosha Wang, et. al.Xiaosha Wang ... Yanchao Bi
28 Mar 2023
28 Mar 2023

Editor's evaluation: Early language exposure affects neural mechanisms of semantic representations
Jonathan Erik Peelle
-
Jonathan Erik PeelleJonathan Erik Peelle
23 Jan 2023
23 Jan 2023

The Influence of Chinese Characters on Chinese Sign Language
Tianyu Ren ... Xinchen Kang
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 23
Tianyu Ren, et. al.Tianyu Ren ... Xinchen Kang
15 Jan 2024
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CSLNSpeech: Solving the extended speech separation problem with the help of Chinese sign language

Abstract

Talk to us

Similar Papers

More From: Speech Communication