Abstract

A three-layered prosody control description language is used to insert prosodic feature control commands in a text at the positions of characters or a character string to be added with non-verbal information. The three-layered prosody control description language is composed of: a semantic layer (S layer) having, as its prosodic feature control commands, control commands each represented by a word indicative of the meaning of non-verbal information; an interpretation layer (I layer) having, as its prosodic feature control commands, control commands which interpret the prosodic feature control commands of the S layer and specify control of prosodic parameters of speech; and a parameter layer (P layer) having prosodic parameters which are objects of control by the prosodic feature control commands of the I layer. The text is converted into a prosodic parameter string through synthesis-by-rule. The prosodic parameters corresponding to characters or character string to be corrected are corrected by the prosodic feature control commands of the I layer, and speech is synthesized from a parameter string containing the corrected prosodic parameters.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.