Place Of Articulation Research Articles

Research work on the design of robust multimodal speech recognition systems making use of acoustic and visual cues, extracted using the relatively noise robust alternate speech sensors is gaining interest in recent times among the speech processing research fraternity. The primary objective of this work is to study the exclusive influence of Lombard effect on the automatic recognition of the confusable syllabic consonant-vowel units of Hindi language, as a step towards building robust multimodal ASR systems in adverse environments in the context of Indian languages which are syllabic in nature. The dataset for this work comprises the confusable 145 consonant-vowel (CV) syllabic units of Hindi language recorded simultaneously using three modalities that capture the acoustic and visual speech cues, namely normal acoustic microphone (NM), throat microphone (TM) and a camera that captures the associated lip movements. The Lombard effect is induced by feeding crowd noise into the speaker’s headphone while recording. Convolutional Neural Network (CNN) models are built to categorise the CV units based on their place of articulation (POA), manner of articulation (MOA), and vowels (under clean and Lombard conditions). For validation purpose, corresponding Hidden Markov Models (HMM) are also built and tested. Unimodal Automatic Speech Recognition (ASR) systems built using each of the three speech cues from Lombard speech show a loss in recognition of MOA and vowels while POA gets a boost in all the systems due to Lombard effect. Combining the three complimentary speech cues to build bimodal and trimodal ASR systems shows that the recognition loss due to Lombard effect for MOA and vowels reduces compared to the unimodal systems, while the POA recognition is still better due to Lombard effect. A bimodal system is proposed using only alternate acoustic and visual cues which gives a better discrimination of the place and manner of articulation than even standard ASR system. Among the multimodal ASR systems studied, the proposed trimodal system based on Lombard speech gives the best recognition accuracy of 98%, 95%, and 76% for the vowels, MOA and POA, respectively, with an average improvement of 36% over the unimodal ASR systems and 9% improvement over the bimodal ASR systems.

Read full abstract

Objectives: In this study, examined the acoustic properties of affricates /t/ and /th/ in Mandarin Chinese, and analyzed the differences of the acoustic characteristics of these affricates produced by children with repaired cleft palate and normally developing children. We also explored the relationship between the affricates and high-front vowel /i/. Methods: We analyzed 16 monosyllabic words with alveolo-palatal affricates as the initial consonants produced by children with repaired cleft palate (N=13, Mean=5.9 years) and normally developing children (N=6, Mean age=5.3 years). We used several acoustic parameters to investigate the characteristics of these affricates, such as the center of gravity, VOT and the formants of vowels. Results: Compared with normally developing children, children with cleft palate exhibited a lower center of gravity for the 2 affricates /t/ and /th/. Data from the control group showed that the affricate /th/ had a significantly greater center of gravity than that of /t/. The accuracy of /t , th/ produced by speakers of cleft palate was significantly correlated with that of /i/ (r=0.63). High-front vowel /i/ is a significant index in diagnosing speech intelligibility which is more valuable than /a/ and /u/. There was a significant difference in F2 of vowel /i/ between children with cleft palate without speech therapy (CS1) and after speech therapy (CS2). After speech intervention, the accuracy of affricates produced by children with cleft palate was improved, the acoustic properties "stop + noise segments" appeared. Conclusion: Children with cleft palate can be distinguished better from children with normal development by 2 significant acoustic characteristics: center of gravity and VOT. As alveolo-palatal affricates /t , th/ and high-front vowel /i/ have a similar place of articulation, front-tongue-blade, their production accuracy can be improved mutually. The analysis showed that the articulation of Chinese /i/ has a higher frontal lingual position and less variability, which is more conducive to articulation training and improves the effect of cleft palate training. These findings provide a potential relationship on affricates /t, th/ and vowel /i/. Children with cleft palate have difficulty pronouncing the /t, t h/ and /i/. It is better to start with a vowel /i/, resulting in improvement in overall speech intelligibility.

Read full abstract

Place Of Articulation Research Articles

Related Topics

Articles published on Place Of Articulation

A Study on the Impact of Lombard Effect on Recognition of Hindi Syllabic Units Using CNN Based Multimodal ASR Systems

Phonetic variation in Italian L2: An acoustic analysis of sibilant fricatives in the speech of L1 Spanish learners

Theory Of English Constant In Phonology

Aging and sex effects on phoneme perception: An exploratory mismatch negativity and P300 investigation

Phonetic variation in Italian L2: An acoustic analysis of sibilant fricatives in the speech of L1 Spanish learners

The association between longitudinal declines in speech sound accuracy and speech intelligibility in speakers with amyotrophic lateral sclerosis

Acoustic and Perceptual Categorization of Sibilants for Mandarin Children With Ankyloglossia.

ASYMMETRY IN THE SIMPLIFICATION OF REVERSED SONORITY CLUSTERS IN (A)TYPICAL PHONOLOGICAL DEVELOPMENT: EVIDENCE FROM GREEK

The evolution of similarity avoidance: a phylogenetic approach to phonotactic change

Knowledge-Based Features for Speech Analysis and Classification: Pronunciation Diagnoses

Voice Onset Time in a language without voicing contrast: An acoustic analysis of Blackfoot oral stops

Quantitative Acoustic versus Deep Learning Metrics of Lenition

The Euclidean metrics applied to the interphonemic distance measurements

The McGurk Illusion: A Default Mechanism of the Auditory System

Sonority sequencing and its relationship to articulatory timing in Georgian

The Historical Changes of /k/ and /q/ in Najdi Arabic: A Phonological Analysis

모음 스펙트럼에 기반한 전후 비자음 조음위치 판별*

Neural networks’ posterior probability as measure of effects of alcohol on speech

A Corpus-based Study on Speech Errors in Pronouncing the Fricative // by Chinese Learners of English

Characteristics of Alveolo-palatal Affricates Produced by Mandarin-speaking Children with Repaired Cleft Palate.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Place Of Articulation Research Articles

Related Topics

Articles published on Place Of Articulation

A Study on the Impact of Lombard Effect on Recognition of Hindi Syllabic Units Using CNN Based Multimodal ASR Systems

Phonetic variation in Italian L2: An acoustic analysis of sibilant fricatives in the speech of L1 Spanish learners

Theory Of English Constant In Phonology

Aging and sex effects on phoneme perception: An exploratory mismatch negativity and P300 investigation

Phonetic variation in Italian L2: An acoustic analysis of sibilant fricatives in the speech of L1 Spanish learners

The association between longitudinal declines in speech sound accuracy and speech intelligibility in speakers with amyotrophic lateral sclerosis

Acoustic and Perceptual Categorization of Sibilants for Mandarin Children With Ankyloglossia.

ASYMMETRY IN THE SIMPLIFICATION OF REVERSED SONORITY CLUSTERS IN (A)TYPICAL PHONOLOGICAL DEVELOPMENT: EVIDENCE FROM GREEK

The evolution of similarity avoidance: a phylogenetic approach to phonotactic change

Knowledge-Based Features for Speech Analysis and Classification: Pronunciation Diagnoses

Voice Onset Time in a language without voicing contrast: An acoustic analysis of Blackfoot oral stops

Quantitative Acoustic versus Deep Learning Metrics of Lenition

The Euclidean metrics applied to the interphonemic distance measurements

The McGurk Illusion: A Default Mechanism of the Auditory System

Sonority sequencing and its relationship to articulatory timing in Georgian

The Historical Changes of /k/ and /q/ in Najdi Arabic: A Phonological Analysis

모음 스펙트럼에 기반한 전후 비자음 조음위치 판별*

Neural networks’ posterior probability as measure of effects of alcohol on speech

A Corpus-based Study on Speech Errors in Pronouncing the Fricative // by Chinese Learners of English

Characteristics of Alveolo-palatal Affricates Produced by Mandarin-speaking Children with Repaired Cleft Palate.