Confusion analysis in phoneme based speech recognition in Hindi

Shobha Bhatt,Anurag Jain,Amita Dev

doi:10.1007/s12652-020-01703-x

Abstract

Phoneme recognition is an essential step in the development of a speech recognition system (SRS), as phonemes are fundamental building blocks in a spoken language. This research work aimed to present phoneme recognition with systematic confusion analysis for the Hindi language. The accuracy of phoneme recognition is the foundation for developing an efficient SRS. Therefore, the systematic confusion analysis for phoneme recognition is essential to improve speech recognition performance. Experiments conducted on Continuous Hindi speech corpus for phoneme recognition with speaker-dependent mode using Hidden Markov Model (HMM) based tool kit HTK. Feature extraction technique Perceptual Linear Predictive Coefficient (PLP) was used with five states Monophones HMM model. Tests were performed for exploring the recognition of Hindi vowels and consonants. Confusion matrices were presented for both vowels and consonants with analysis and possible solutions. During systematic analysis, the vowels were divided into front, middle, and back vowels while consonants were categorized based on place of articulation and manner of articulation. Research findings show that some Hindi phonemes have significant effects on speech recognition. The investigations also reveal that some Hindi phonemes are mostly confused, and some phonemes have more deletions and insertions. The research further demonstrates that the words made of less number of phonemes show more insertion errors. It was also found that most of the Hindi sentences end with some specific words. These particular words can be used to reduce the search place in language modeling for improving speech recognition. The research findings can be utilized to enhance the performance of the speech recognition system by selecting suitable feature extraction techniques and classification techniques for phonemes. The outcome of the research can also be used to develop improved pronunciation dictionaries and designing the text for developing phonetically balanced speech corpus for improvement in speech recognition. Experimental results show an average corrected recognition score of 70% for vowel class and consonant categories, the maximum average corrected recognition score of 94% was obtained with palatal sounds, and the lowest average corrected recognition score of 54% was achieved with liquid sounds. The comparative analysis of the presented work was made to similar existing works.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Confusion analysis in phoneme based speech recognition in Hindi

Abstract

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing

Lead the way for us

Journal: Journal of Ambient Intelligence and Humanized Computing	Publication Date: Feb 1, 2020
Citations: 15

Similar Papers

Hindi Phoneme Recognition - A Review
Shobha Bhatt ... Amita Dev
-
Shobha Bhatt, et. al.Shobha Bhatt ... Amita Dev
01 Jan 2021
01 Jan 2021

A Study on the Impact of Lombard Effect on Recognition of Hindi Syllabic Units Using CNN Based Multimodal ASR Systems
...
Archives of Acoustics | VOL. 45
, et. al. ...
26 Jul 2023
Archives of Acoustics | VOL. 45

Effects of spectral smearing on phoneme and word recognition.
Arthur Boothroyd ... Juan Gong
The Journal of the Acoustical Society of America | VOL. 100
Arthur Boothroyd, et. al.Arthur Boothroyd ... Juan Gong
01 Sep 1996
The Journal of the Acoustical Society of America | VOL. 100

Feature Extraction Techniques with Analysis of Confusing Words for Speech Recognition in the Hindi Language
Shobha Bhatt ... Anurag Jain
Wireless Personal Communications | VOL. 118
Shobha Bhatt, et. al.Shobha Bhatt ... Anurag Jain
13 Feb 2021
Wireless Personal Communications | VOL. 118

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Confusion analysis in phoneme based speech recognition in Hindi

Abstract

Talk to us

Similar Papers

More From: Journal of Ambient Intelligence and Humanized Computing