Acoustic modeling for Kazakh speech synthesis

A.K Kaliyev,S.V Rybin

doi:10.17586/2226-1494-2019-19-5-951-954

Acoustic modeling for Kazakh speech synthesis

A.K Kaliyev, S.V Rybin

Open Access

PDF Available

https://doi.org/10.17586/2226-1494-2019-19-5-951-954

Copy DOI

Export

Save

Cite

Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics	Publication Date: Oct 1, 2019
Citations: 1	License type: cc-by-nc

#Framework Of Generative Adversarial Network #Speech Synthesis #Model For Speech Synthesis #Kazakh Speech #Mean Opinion Score #Generative Adversarial Network #Linguistic Representation #Approach Of Model Development #Acoustic Model #Framework Of Network

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

We present a new framework of generative adversarial network for training of acoustic model for speech synthesis. The proposed generative adversarial network consists of a generator and a pair of agent discriminators, where the generator predicts the acoustic features from the linguistic representation. Training and testing were carried out on the Kazakh speech corpus, which consisted of 5.6 hours of speech recording. According to the experiment results the 3.46 mean opinion score was obtained which shows an acceptable quality of speech synthesis. This approach of the acoustic model development can be applied in speech synthesis systems of the other languages.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Scientific and Technical Journal of Information Technologies, Mechanics and Optics

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Acoustic modeling for Kazakh speech synthesis