FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis

Meizhen Zheng,Yiting Yan,Xun Zhou,Peng Bai,Xiaodong Shi

doi:10.1609/aaai.v38i17.29943

Abstract

Although singing voice synthesis (SVS) has made significant progress recently, with its unique styles and various genres, Chinese opera synthesis requires greater attention but is rarely studied for lack of training data and high expressiveness. In this work, we build a high-quality Gezi Opera (a type of Chinese opera popular in Fujian and Taiwan) audio-text alignment dataset and formulate specific data annotation methods applicable to Chinese operas. We propose FT-GAN, an acoustic model for fine-grained tune modeling in Chinese opera synthesis based on the empirical analysis of the differences between Chinese operas and pop songs. To further improve the quality of the synthesized opera, we propose a speech pre-training strategy for additional knowledge injection. The experimental results show that FT-GAN outperforms the strong baselines in SVS on the Gezi Opera synthesis task. Extensive experiments further verify that FT-GAN performs well on synthesis tasks of other operas such as Peking Opera. Audio samples, the dataset, and the codes are available at https://zhengmidon.github.io/FTGAN.github.io/.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
Jinglin Liu ... Chengxi Li
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Jinglin Liu, et. al.Jinglin Liu ... Chengxi Li
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation
Rongjie Huang ... Zhou Zhao
-
Rongjie Huang, et. al.Rongjie Huang ... Zhou Zhao
10 Oct 2022
10 Oct 2022

An HMM-based singing voice synthesis system
Keijiro Saino ... Yoshihiko Nankaku
-
Keijiro Saino, et. al.Keijiro Saino ... Yoshihiko Nankaku
17 Sep 2006
17 Sep 2006

HiddenSinger: High-quality singing voice synthesis via neural audio codec and latent diffusion models
Ji-Sang Hwang ... Seong-Whan Lee
Neural Networks | VOL. 181
Ji-Sang Hwang, et. al.Ji-Sang Hwang ... Seong-Whan Lee
27 Sep 2024
Neural Networks | VOL. 181

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

FT-GAN: Fine-Grained Tune Modeling for Chinese Opera Synthesis

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence