Changes in intonation patterns may convey not only different meaning, but also different emotions even if the sequence of speech segments is the same in a sentence. The patterns change depending upon the structure and emotion of the sentence and require being stored in speech database. It is a difficult and time-consuming task to store all utterances of all the expressive style, which also consumes huge memory space. Therefore, there should be an approach that minimizes the time and memory space for emotion-rich database. A number of studies in this respect have been carried out for several languages and models developed. However, for Hindi not many studies have been carried out. Taking this fact into consideration, the intonation patterns have been studied and analysed for Hindi language and compared with some other languages. On the basis of the analysis on intonation patterns, an algorithm has been proposed for emotion conversion. This algorithm requires storing of neutral utterances only in the database and other expressive style utterances can be derived from these neutral utterances. The proposed algorithm is based on the linear modification model, where fundamental frequency is a major factor to convert emotions. In this paper, surprise, sadness, happiness, and anger are taken as target emotions. For performing experiments and verification of results, an emotion-rich speech database is prepared with the help of 15 native speakers. Each speaker has been given 20 sentences and asked to generate utterances in five expressive states “neutral”, “sadness”, “anger”, “surprise”, and “happiness”. These sentences are recorded in normal laboratory conditions with a 44.1 KHz sampling rate and 16-bit precision with mono channel. In the perception test, group of listeners were asked to listen to the utterances from database and judge the emotion. This perception test involves classification of the emotions already available in the database by the listener and to judge the quality of converted neutral utterances. The results are analysed and performance of the experiment is evaluated. The accuracy of the perception test on transformed emotions was found out to be 93.2% for surprise, 91.6% for sadness, 83% for happiness, and 95.3% for anger.
Read full abstract