Research on text analysis for Tibetan statistical parametric speech synthesis

Zhenye Gan,Shuai Zhang,Xinjie Kong

doi:10.1109/cisp-bmei.2016.7852834

Abstract

Text analysis is the front-end of a TTS system, which has a great influence on the naturalness of the back-end speech synthesis. Statistical parametric speech synthesis is being commonly applied into speech synthesis now, and gradually becoming an important method of the current speech synthesis, however, the research of front-end text analysis is often overlooked in the process of current Tibetan speech synthesis, so the research of Tibetan text analysis is still staying in an initial stage. The research of this paper is faced on the Tibetan statistical parametric speech synthesis system, the mainly work of us is to have a text analysis for the input Tibetan text, and the aim is to acquire the mono-phone labeling information and context-dependent labeling information, which is needed by the back-end speech synthesis. Finally, we have a speech synthesis to the Tibetan text with the statistical parametric information that we have obtained through the process of text analysis, then the quality of synthetic speech is evaluated by MOS evaluation, we choose randomly 50 sentences to apply to the process of MOS evaluation, through the experiment, the score of MOS evaluation can reach 4.0 points, so it indicated that the naturalness and intelligibility of synthetic speech is good, and the method of this paper is effective.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on text analysis for Tibetan statistical parametric speech synthesis

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Constructing a Deep Neural Network Based Spectral Model for Statistical Speech Synthesis
Shinji Takaki ... Junichi Yamagishi
-
Shinji Takaki, et. al.Shinji Takaki ... Junichi Yamagishi
01 Jan 2015
01 Jan 2015

Multi-speaker modeling with shared prior distributions and model structures for Bayesian speech synthesis
Kei Hashimoto ... Yoshihiko Nankaku
-
Kei Hashimoto, et. al.Kei Hashimoto ... Yoshihiko Nankaku
27 Aug 2011
27 Aug 2011

Measuring the Quality of Low-Resourced Statistical Parametric Speech Synthesis Trained with Noise-Degraded Data Supported by the University of Costa Rica
Marvin Coto-Jiménez
Computación y Sistemas | VOL. 26
Marvin Coto-JiménezMarvin Coto-Jiménez
30 Jun 2022
Computación y Sistemas | VOL. 26

Postfilters to Modify the Modulation Spectrum for Statistical Parametric Speech Synthesis
Shinnosuke Takamichi ... Graham Neubig
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24
Shinnosuke Takamichi, et. al.Shinnosuke Takamichi ... Graham Neubig
01 Apr 2016
IEEE/ACM Transactions on Audio, Speech, and Language Processing | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on text analysis for Tibetan statistical parametric speech synthesis

Abstract

Talk to us

Similar Papers