A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System

Deok-Su Na,Myung-Jin Bae

doi:10.1587/transinf.e92.d.349

Deok-Su Na, Myung-Jin Bae

Open Access

https://doi.org/10.1587/transinf.e92.d.349

Copy DOI

Abstract

Break prediction is an important step in text-to-speech systems as break indices (BIs) have a great influence on how to correctly represent prosodic phrase boundaries. However, an accurate prediction is difficult since BIs are often chosen according to the meaning of a sentence or the reading style of the speaker. In Japanese, the prediction of an accentual phrase boundary (APB) and major phrase boundary (MPB) is particularly difficult. Thus, this paper presents a method to complement the prediction errors of an APB and MPB. First, we define a subtle BI in which it is difficult to decide between an APB and MPB clearly as a variable break (VB), and an explicit BI as a fixed break (FB). The VB is chosen using the classification and regression tree, and multiple prosodic targets in relation to the pith and duration are then generated. Finally, unit-selection is conducted using multiple prosodic targets. The experimental results show that the proposed method improves the naturalness of synthesized speech.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEICE Transactions on Information and Systems	Publication Date: Jan 1, 2009
Citations: 4	License type: free

R Discovery Prime

R Discovery Prime

A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems

Lead the way for us

Similar Papers

Focus Marking and Prosodic Boundary Strength in French
Amandine Michelas ... James S. German
Phonetica | VOL. 77
Amandine Michelas, et. al.Amandine Michelas ... James S. German
01 Jul 2019
Phonetica | VOL. 77

Break Index (BI) annotated speech corpus for Urdu TTS
Benazir Mumtaz ... Sarmad Hussain
-
Benazir Mumtaz, et. al.Benazir Mumtaz ... Sarmad Hussain
01 Oct 2016
01 Oct 2016

Prosodic Modifications of the Internal Phonetic Structure of Monosyllabic CVC Words in Conversational Speech
Yoonsook Mo
Phonetics and Speech Sciences | VOL. 5
Yoonsook MoYoonsook Mo
31 Mar 2013
Phonetics and Speech Sciences | VOL. 5

An exploration into Penultimate and Final Lengthening in Tswana (Southern Bantu)
Fabian Schubö ... Sabine Zerbian
- Stellenbosch Papers in Linguistics Plus | VOL. 62
Fabian Schubö, et. al.Fabian Schubö ... Sabine Zerbian
01 Aug 2021
- Stellenbosch Papers in Linguistics Plus | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Variable Break Prediction Method Using CART in a Japanese Text-to-Speech System

Abstract

Talk to us

Similar Papers

More From: IEICE Transactions on Information and Systems