Abstract

This paper proposes a variant of decision tree (DT) for HMM-based speech synthesis. We call it Extended Decision Tree with OR Relationship (EDTOR). A leaf node in conventional DT is uniquely reached by answering a series of yes/no questions starting from its root node until the leaf node. Thus the decision condition for deciding whether the acoustic parameters of a context label belong to a certain leaf node is subject to AND logical expressions. However, some linguistic knowledge cannot be represented by AND logical expressions compactly and efficiently. We introduce OR relationship to DT at leaf node level to loosen the restriction on DT. Preliminary experimental results show that EDTOR can, 1) greatly reduce the leaf node number of DT (i.e., model size) without affecting speech synthesis performance, which is appealing to embedded applications, or, 2) slightly improve the performance if DT has the same leaf node number as that of EDTOR.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call