Адаптація фреймворку WORLD для пофреймового аналізу мовлення в реальному часі

Eugene Koshel

doi:10.34185/1562-9945-5-148-2023-03

Адаптація фреймворку WORLD для пофреймового аналізу мовлення в реальному часі

Eugene Koshel

Open Access

https://doi.org/10.34185/1562-9945-5-148-2023-03

Copy DOI

Journal: System technologies	Publication Date: Mar 20, 2024
License type: CC BY 4.0

#Synthetic Signals #Speech Synthesis System + Show 2 more

Abstract
Full-Text PDF
Similar Papers

Abstract

WORLD is a vocoder-based speech synthesis system developed by M. Morise et al. and implemented in C++. It was demonstrated to have improved performance and accuracy when compared to other algorithms. However, it turned out to not perform well in certain scenarios, particularly, when applying the framework to very short waveforms on a frame-by-frame basis. This paper reviews the issues of the C++ implementation of WORLD and pro-poses modified versions of its constituting algorithms that attempt to mitigate those issues. The resulting framework is tested on both synthetic signals and on real recorded speech.

Full Text