Abstract
PurposeThis paper aims to disentangle Chinese-English-rich resources linguistic and speaker timbre features, achieving cross-lingual speaker transfer for Cambodian.Design/methodology/approachThis study introduces a novel approach: the construction of a cross-lingual feature disentangler coupled with the integration of time-frequency attention adaptive normalization to proficiently convert Cambodian speaker timbre into Chinese-English without altering the underlying Cambodian speech content.FindingsConsidering the limited availability of multi-speaker corpora in Cambodia, conventional methods have demonstrated subpar performance in Cambodian speaker voice transfer.Originality/valueThe originality of this study lies in the effectiveness of the disentanglement process and precise control over speaker timbre feature transfer.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have