Research on speech style transfer algorithm combined with image processing perspective

Yuanqi Chen

doi:10.54097/fcis.v3i1.6032

Abstract

Speech, as the acoustic expression of language, is one of the most natural and effective means of human information communication. With the rapid development of the Internet and communication technology, the function of robot voice interaction is more and more popular among people. However, the robotic pronunciation cannot meet people's demand for personalized voice interaction. At the same time, style transfer technology, which is widely used in image and video processing, has been relatively mature. By studying the theoretical methods of generalized style transfer technology (including the style transfer of images and video signals), and comparing and analyzing various machine learning algorithms used by the current voice style transfer technology, this paper draws the following conclusions: First, various models generally have the problem of large demand for training data and difficulty in training. Second, an algorithm model shows the alienation effect in different usage scenarios. Finally, based on the above problems, suggestions for the development of voice style transfer are put forward.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Research on speech style transfer algorithm combined with image processing perspective

Abstract

Talk to us

Similar Papers

More From: Frontiers in Computing and Intelligent Systems

Lead the way for us

Journal: Frontiers in Computing and Intelligent Systems	Publication Date: Mar 17, 2023
License type: CC BY 4.0

Similar Papers

A Layered Algorithm of Style Transfer
Qi Lin ... Qing Zhu
-
Qi Lin, et. al.Qi Lin ... Qing Zhu
01 Oct 2022
01 Oct 2022

An Attribute-enhanced Method based on Latent Space Factorization for Face Image Style Transfer
Tingyan Gu ... Wenjun Zhang
-
Tingyan Gu, et. al.Tingyan Gu ... Wenjun Zhang
07 Apr 2023
07 Apr 2023

Effective writing style transfer via combinatorial paraphrasing
Tommi Gröndahl ... N Asokan
Proceedings on Privacy Enhancing Technologies | VOL. 2020
Tommi Gröndahl, et. al.Tommi Gröndahl ... N Asokan
17 Aug 2020
Proceedings on Privacy Enhancing Technologies | VOL. 2020

Style transfer with VGG19
Langtian Lang
Applied and Computational Engineering | VOL. 6
Langtian LangLangtian Lang
14 Jun 2023
Applied and Computational Engineering | VOL. 6

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Research on speech style transfer algorithm combined with image processing perspective

Abstract

Talk to us

Similar Papers

More From: Frontiers in Computing and Intelligent Systems