Abstract
To solve problem of lacking emotional speech database with rich emotion annotation information,a Chinese dual-mode emotional speech database which contained speech and Electroglottography( EGG)information was established. Annotation and consistency detection for the established database were conducted. Firstly,we designed detailed annotation rules and methods according to characteristics of emotional speech database and selected 5 annotators labeling emotional speech database in accordance with the rules.Secondly,in order to ensure annotation quality of emotional speech database and test the integrity of annotation rules,annotators labeled parts of utterances as a test before the official annotation,the test material comprises280 sentences( seven emotions × two actors × twenty sentences). Finally,according to the speech annotation rules,we designed corresponding consistency detection algorithm. The results show that within the time error range of 5 ms,the annotation consistency for the same utterances which labeled by 5 annotators reaches more than 60% on average. When the time error range increased to 8 ms and 10 ms,consistency can be increased by 5% and 8% on average. The experiment indicates that 5 annotators are more consistent in understanding speech. The annotation rules we designed are more complete. The quality of emotional speech database is higher.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Journal of Beijing University of Aeronautics and Astronautics
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.