Learning Voice Representation Using Knowledge Distillation for Automatic Voice Casting

Adrien Gresse,Richard Dufour,Jean-François Bonastre,Mathias Quillot

doi:10.21437/interspeech.2020-2236

Abstract

The search for professional voice-actors for audiovisual productions is a sensitive task, performed by the artistic directors (ADs). The ADs have a strong appetite for new talents/voices but cannot perform large scale auditions. Automatic tools able to suggest the most suited voices are of a great interest for audiovisual industry. In previous works, we showed the existence of acoustic information allowing to mimic the AD's choices. However, the only available information is the ADs' choices from the already dubbed multimedia productions. In this paper, we propose a representation-learning based strategy to build a character/role representation, called p-vector. In addition, the large variability between audiovisual productions makes difficult to have homogeneous training datasets. We overcome this difficulty by using knowledge distillation methods to take advantage of external datasets. Experiments are conducted on video-game voice excerpts. Results show a significant improvement using the p-vector, compared to the speaker-based x-vectors representation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Voice Representation Using Knowledge Distillation for Automatic Voice Casting

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Oct 25, 2020
Citations: 2	License type: other-oa

Similar Papers

Multimedia and the creation of the scenographic space in the stage realization of Emmanuel Emmasealu’s nerves
Kenneth Efakponana Eni
AFRREV IJAH: An International Journal of Arts and Humanities | VOL. 7
Kenneth Efakponana EniKenneth Efakponana Eni
16 Jul 2018
AFRREV IJAH: An International Journal of Arts and Humanities | VOL. 7

Discretization and decoupled knowledge distillation for arbitrary oriented object detection
Cheng Chen ... Hongwei Ding
Digital Signal Processing | VOL. 150
Cheng Chen, et. al.Cheng Chen ... Hongwei Ding
17 Apr 2024
Digital Signal Processing | VOL. 150

Investigation of Sequence-level Knowledge Distillation Methods for CTC Acoustic Models
Ryoichi Takashima ... Hisashi Kawai
-
Ryoichi Takashima, et. al.Ryoichi Takashima ... Hisashi Kawai
01 May 2019
01 May 2019

Multi-perspective analysis on data augmentation in knowledge distillation
Wei Li ... Aiguo Song
Neurocomputing | VOL. 583
Wei Li, et. al.Wei Li ... Aiguo Song
05 Mar 2024
Neurocomputing | VOL. 583

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Voice Representation Using Knowledge Distillation for Automatic Voice Casting

Abstract

Talk to us

Similar Papers