Weight optimization for bimodal unit-selection talking head synthesis

Asterios Toutios,Utpala Musti,Vincent Colotte,Slim Ouni

doi:10.21437/interspeech.2011-598

Weight optimization for bimodal unit-selection talking head synthesis

Asterios Toutios, Utpala Musti + Show 2 more

https://doi.org/10.21437/interspeech.2011-598

Copy DOI

Publication Date: Aug 27, 2011

#Audiovisual Speech Synthesis #Talking Head + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper addresses talking head synthesis based on the concatenation of units comprising of both acoustic and visual information. Selection of appropriate diphone units to synthesize ag iven text string is based on the minimization of aw eighted linear combination of four costs that reflect linguistic, acoustic, and visual considerations. We present initial work toward a method to determine automatically the weights applied to each cost, using a series of metrics that assess quantitatively the performance of synthesis. Index Terms :t alking head, audiovisual speech synthesis, selection, optimization

Full Text