Abstract

This research is concerned with the development of speech recognition front-end for segmenting and clustering continuous Bangla speech sentence to some predefined clusters. From the study of different previous research works it was observed that the front-end is an important part of any speech recognition system. In our work, the original speech sentences were recorded and stored as RIFF (.wav) file format. Then a segmentation approach was used to segment the continuous speech into uniquely identifiable and meaningful units. Among the different techniques, the word/sub-word segmentation is simple and produces very good results. This is why this technique was selected for speech segmentation to obtain improved performance. After segmentation, the segmented words were clustered into different clusters according to the number of syllables and the sizes of the segmented words. The test database contained 758 words/sub-words segmented from 120 sentences. Each sentence was recorded from six different speakers and saved as a different wave file. The developed system achieved the segmentation accuracy rate at about 95%. Keywords: Front-end, Phonemic and Word segmentation, Clustering, End Point Detection. DOI: 10.3329/diujst.v5i1.4384 Daffodil International University Journal of Science and Technology Vol.5(1) 2010 pp.67-72

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.