Consonant Recognition Research Articles

In this paper, the effects of intensifying useful frequency and time regions (target frequency and time ranges) and the removal of detrimental frequency and time regions (conflicting frequency and time ranges) for consonant enhancement were determined. Thirteen normal-hearing (NH) listeners participated in two experiments. In the first experiment, the target and conflicting frequency and time ranges for each consonant were identified under a quiet, dichotic listening condition by analyzing consonant confusion matrices. The target frequency range was defined as the frequency range that provided the highest performance and was decreased 40% from the peak performance from both high-pass filtering (HPF) and low-pass filtering (LPF) schemes. The conflicting frequency range was defined as the frequency range that yielded the peak errors of the most confused consonants and was 20% less than the peak error from both filtering schemes. The target time range was defined as a consonant segment that provided the highest performance and was decreased 40% from that peak performance when the duration of the consonant was systematically truncated from the onset. The conflicting time ranges were defined on the coincided target time range because, if they temporarily coincide, the conflicting frequency ranges would be the most detrimental factor affecting the target frequency ranges. In the second experiment, consonant recognition was binaurally measured in noise under three signal processing conditions: unprocessed, intensified target ranges by a 6-dB gain (target), and combined intensified target and removed conflicting ranges (target-conflicting). The results showed that consonant recognition improved significantly with the target condition but greatly deteriorated with a target-conflicting condition. The target condition helped transmit voicing and manner cues while the target-conflicting condition limited the transmission of these cues. Confusion analyses showed that the effect of the signal processing on consonant improvement was consonant-specific: the unprocessed condition was the best for /da, pa, ma, sa/; the target condition was the best for /ga, fa, va, za, ʒa/; and the target-conflicting condition was the best for /na, ʃa/. Perception of /ba, ta, ka/ was independent of the signal processing. The results suggest that enhancing the target ranges is an efficient way to improve consonant recognition while the removal of conflicting ranges negatively impacts consonant recognition.

Read full abstract

Phoneme recognition is an essential step in the development of a speech recognition system (SRS), as phonemes are fundamental building blocks in a spoken language. This research work aimed to present phoneme recognition with systematic confusion analysis for the Hindi language. The accuracy of phoneme recognition is the foundation for developing an efficient SRS. Therefore, the systematic confusion analysis for phoneme recognition is essential to improve speech recognition performance. Experiments conducted on Continuous Hindi speech corpus for phoneme recognition with speaker-dependent mode using Hidden Markov Model (HMM) based tool kit HTK. Feature extraction technique Perceptual Linear Predictive Coefficient (PLP) was used with five states Monophones HMM model. Tests were performed for exploring the recognition of Hindi vowels and consonants. Confusion matrices were presented for both vowels and consonants with analysis and possible solutions. During systematic analysis, the vowels were divided into front, middle, and back vowels while consonants were categorized based on place of articulation and manner of articulation. Research findings show that some Hindi phonemes have significant effects on speech recognition. The investigations also reveal that some Hindi phonemes are mostly confused, and some phonemes have more deletions and insertions. The research further demonstrates that the words made of less number of phonemes show more insertion errors. It was also found that most of the Hindi sentences end with some specific words. These particular words can be used to reduce the search place in language modeling for improving speech recognition. The research findings can be utilized to enhance the performance of the speech recognition system by selecting suitable feature extraction techniques and classification techniques for phonemes. The outcome of the research can also be used to develop improved pronunciation dictionaries and designing the text for developing phonetically balanced speech corpus for improvement in speech recognition. Experimental results show an average corrected recognition score of 70% for vowel class and consonant categories, the maximum average corrected recognition score of 94% was obtained with palatal sounds, and the lowest average corrected recognition score of 54% was achieved with liquid sounds. The comparative analysis of the presented work was made to similar existing works.

Read full abstract

Consonant Recognition Research Articles

Related Topics

Articles published on Consonant Recognition

Effect of the Target and Conflicting Frequency and Time Ranges on Consonant Enhancement in Normal-Hearing Listeners.

Effects of the Configuration of Hearing Loss on Consonant Perception between Simulated Bimodal and Electric Acoustic Stimulation Hearing.

Effects of Adaptive Non-linear Frequency Compression in Hearing Aids on Mandarin Speech and Sound-Quality Perception

Changes in Orientation Behavior due to Extended High-Frequency (5 to 10 kHz) Spatial Cues.

Vowel Context Effect on the Perception of Stop Consonants in Malayalam and Its Role in Determining Syllable Frequency.

Improving the Ability to Recognize Consonants Through Smart Box Media for Children aged 4-5 Years in Kindergarten

The Effectiveness of Communication-Oriented Pronunciation Instruction and Students&apos; Perception

Using the electrically-evoked compound action potential (ECAP) interphase gap effect to select electrode stimulation sites in cochlear implant users

Consonant Recognition Using Coarticulatory Cues in Individuals with Normal Hearing and Sensorineural Hearing Loss

Auditory Rehabilitation Post-Cochlear Implant

Auditory Rehabilitation Post-Cochlear Implant

Association Between Tinnitus Pitch and Consonant Recognition in Noise.

Effects of nonlinear frequency compression on Mandarin speech and sound-quality perception in hearing-aid users

Penggunaan Media Papan Flanel untuk Meningkatkan Kemampuan Mengenal Huruf Vokal dan Konsonan pada Anak Kelompok B di TKK Rherhedja 2

Auditory and auditory-visual frequency-band importance functions for consonant recognition.

Assessment of Temporal Fine Structure Processing Among Older Adults With Cochlear Implants.

Confusion analysis in phoneme based speech recognition in Hindi

Effects of temporal distortions on consonant perception with and without undistorted visual speech cues.

Age-sensitive associations of segmental and suprasegmental perception with sentence-level language skills in Mandarin-speaking children with cochlear implants

Enhancement of Consonant Recognition in Bimodal and Normal Hearing Listeners.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Consonant Recognition Research Articles

Related Topics

Articles published on Consonant Recognition

Effect of the Target and Conflicting Frequency and Time Ranges on Consonant Enhancement in Normal-Hearing Listeners.

Effects of the Configuration of Hearing Loss on Consonant Perception between Simulated Bimodal and Electric Acoustic Stimulation Hearing.

Effects of Adaptive Non-linear Frequency Compression in Hearing Aids on Mandarin Speech and Sound-Quality Perception

Changes in Orientation Behavior due to Extended High-Frequency (5 to 10 kHz) Spatial Cues.

Vowel Context Effect on the Perception of Stop Consonants in Malayalam and Its Role in Determining Syllable Frequency.

Improving the Ability to Recognize Consonants Through Smart Box Media for Children aged 4-5 Years in Kindergarten

The Effectiveness of Communication-Oriented Pronunciation Instruction and Students&amp;apos; Perception

Using the electrically-evoked compound action potential (ECAP) interphase gap effect to select electrode stimulation sites in cochlear implant users

Consonant Recognition Using Coarticulatory Cues in Individuals with Normal Hearing and Sensorineural Hearing Loss

Auditory Rehabilitation Post-Cochlear Implant

Auditory Rehabilitation Post-Cochlear Implant

Association Between Tinnitus Pitch and Consonant Recognition in Noise.

Effects of nonlinear frequency compression on Mandarin speech and sound-quality perception in hearing-aid users

Penggunaan Media Papan Flanel untuk Meningkatkan Kemampuan Mengenal Huruf Vokal dan Konsonan pada Anak Kelompok B di TKK Rherhedja 2

Auditory and auditory-visual frequency-band importance functions for consonant recognition.

Assessment of Temporal Fine Structure Processing Among Older Adults With Cochlear Implants.

Confusion analysis in phoneme based speech recognition in Hindi

Effects of temporal distortions on consonant perception with and without undistorted visual speech cues.

Age-sensitive associations of segmental and suprasegmental perception with sentence-level language skills in Mandarin-speaking children with cochlear implants

Enhancement of Consonant Recognition in Bimodal and Normal Hearing Listeners.

The Effectiveness of Communication-Oriented Pronunciation Instruction and Students' Perception