Abstract

At the faculty of informatics, Masaryk university, Brno we developed the AUDIS system recently. Description of the system can be found in P. Gaura (2000), P. Gaura, P. Nygryn (2000) and I. Kopecek (1998). AUDIS is developed primarily as a multimodal support that would help visually impaired students to study various materials. For proper functionality of the system inputs and outputs, we need also high quality speech synthesis. Unfortunately, it is not available for Czech language. Therefore, we are developing a speech engine that allows us to produce high quality Czech speech for some limited domains together with the average quality of general Czech speech synthesis (where average means well comprehensible). Limited domain speech synthesis will be used for frequently used speech outputs (e.g. navigation in a document, control of the system), while general speech synthesis will be available for common text. For these purposes we have developed the automatic recording system that allows us to collect and process the large amount of speech data. The basic principles of our speech synthesis, the recording system and the speech segments selection and processing are described in the first part of the paper. The second part of the paper deals with methods for choosing the best set of speech data to be recorded into the corpus and the speech data segmentation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.