SeDepTTS: Enhancing the Naturalness via Semantic Dependency and Local Convolution for Text-to-Speech Synthesis

Chenglong Jiang,Ying Gao,Hongzhong Zhen,Jinghui Zhong,Jiyong Zhou,Wing W.Y Ng

doi:10.1609/aaai.v37i11.26523

Abstract

Self-attention-based networks have obtained impressive performance in parallel training and global context modeling. However, it is weak in local dependency capturing, especially for data with strong local correlations such as utterances. Therefore, we will mine linguistic information of the original text based on a semantic dependency and the semantic relationship between nodes is regarded as prior knowledge to revise the distribution of self-attention. On the other hand, given the strong correlation between input characters, we introduce a one-dimensional (1-D) convolution neural network (CNN) producing query(Q) and value(V) in the self-attention mechanism for a better fusion of local contextual information. Then, we migrate this variant of the self-attention networks to speech synthesis tasks and propose a non-autoregressive (NAR) neural Text-to-Speech (TTS): SeDepTTS. Experimental results show that our model yields good performance in speech synthesis. Specifically, the proposed method yields significant improvement for the processing of pause, stress, and intonation in speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

SeDepTTS: Enhancing the Naturalness via Semantic Dependency and Local Convolution for Text-to-Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Jun 26, 2023
Citations: 1

Similar Papers

Semantic dependency and local convolution for enhancing naturalness and tone in text-to-speech synthesis
Chenglong Jiang ... Xiping Hu
Neurocomputing | VOL. 608
Chenglong Jiang, et. al.Chenglong Jiang ... Xiping Hu
22 Aug 2024
Neurocomputing | VOL. 608

Fault Diagnosis of Rotating Machinery Based on 1D–2D Joint Convolution Neural Network
Wenliao Du ... Xiaoyun Gong
IEEE Transactions on Industrial Electronics | VOL. 70
Wenliao Du, et. al.Wenliao Du ... Xiaoyun Gong
01 May 2023
IEEE Transactions on Industrial Electronics | VOL. 70

A motor bearing fault diagnosis method based on multi-source data and one-dimensional lightweight convolution neural network
Yifan Dong ... Zheng Wang
Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering | VOL. 237
Yifan Dong, et. al.Yifan Dong ... Zheng Wang
07 Oct 2022
Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering | VOL. 237

Traffic State Prediction Using One-Dimensional Convolution Neural Networks and Long Short-Term Memory
Selim Reza ... Marta Campos Ferreira
Applied Sciences | VOL. 12
Selim Reza, et. al.Selim Reza ... Marta Campos Ferreira
19 May 2022
Applied Sciences | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

SeDepTTS: Enhancing the Naturalness via Semantic Dependency and Local Convolution for Text-to-Speech Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence