Phonetic Units Research Articles

It is widely accepted that invariant and discrete phonological units at the linguistic level are transformed into variable and continuous movements of speech organs, which in turn results in equally continuous acoustical results. The variability of phonemic units depends on neighbouring phonetic units, but also on the various linguistic, communicational and pragmatic contexts of a particular speech act. The influence of phonetic units upon each other results in adaptations, coarticulations and assimilations. By means of assimilation at least one distinctive feature of a phoneme is changed, so the observed phoneme becomes similar to its neighbouring sound – the assimilation operator. This paper is aimed at analysing the influence of speech rate on assimilation processes in the voiced fricative /z/, when it is preceded by sounds /s, z, ʃ, ʒ / in four different types of articulatory joint: sentence, clausal, lexemic and proclitical. The articulatory joint refers to the production of two phonemes separated by different types of linguistic boundaries. Twenty female native speakers of Croatian with no history of speech or hearing impairments read a text at both natural and fast speech rates. The acoustical recording was performed in a sound-treated room. The Praat software was used to analyse six variables in all occurrences of the sound /z/: duration, spectrum centre of gravity, standard deviation of the centre of gravity, spectral skewness, spectral kurtosis, and harmonic to noise ratio. The results showed that various linguistic boundaries, speech rates and sounds as assimilation operators influence the degree of assimilation of the phoneme /z/, as measured by the acoustic variables.

Read full abstract

Heterophones pose challenges during training of automatic speech recognition (ASR) systems because they involve ambiguity in the pronunciation of an orthographic representation of a word. Heterophones are words that have the same spelling but different pronunciations. This paper addresses the problem of heterophonic languages by developing the concept of a Composite Phoneme (CP) as a basic pronunciation unit for speech recognition. A CP is a set of alternative sequences of phonemes. CP’s are developed specifically in the context of Arabic by defining phonetic units that are consonant centric and absorb phonemically contrastive short vowels and gemination, not represented in the Arabic Modern Orthography (MO). CPs alleviate the need to diacritize MO into Classical Orthography (CO), to represent short vowels and stress, before generating pronunciation in terms of Simple Phonemes (SP). We develop algorithms to generate CP pronunciation from MO, and SP pronunciation from CO to map a word into a single pronunciation. We investigate the performance of CP, SP, UG (Undiacritized Grapheme), and DG (Diacritized Grapheme) ASRs. The experimental results suggest that UG and DG are inferior to SP and CP. For the A-SpeechDB corpus with MO vocabulary of 8000, the WER for bigram and context dependent phone are: 11.78, 12.64, and 13.59 % for CP, SP_M (SP from manual diacritized CO), and SP_A (SP from automated diacritized MO) respectively. For vocabulary of 24,000 MO words, the corresponding WER’s are 13.69, 15.08, and 16.86 %. For uniform statistical model, SP has a lower WER than CP. For context independent phone (CI), CP has lower WER than SP.

Read full abstract

Phonetic Units Research Articles

Related Topics

Articles published on Phonetic Units

An evaluation of sentence selection methods on the different phone-sized units for constructing Indonesian speech corpus

Basic auditory processing deficits in infants at risk for dyslexia during the sensitive period predict future language

Segmentation of speech on phonetic elements for systems of speech information protection

Phonology based Fuzzy Phoneme Recognition

Itakura–Saito Divergence as an Element of the Information Theory of Speech Perception

Graphemic-phonetic diachronic linguistic invariance of the frequency and of the Index of Coincidence as cryptanalytic tools.

А Study on Ethnic Minority Languages in Siberia: Focused on Buryat Language

Development and analysis of multilingual phone recognition systems using Indian languages

Comparison of Phonemic and Graphemic Word to Sub-Word Unit Mappings for Lithuanian Phone-Level Speech Transcription

A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing

A robust unsupervised pattern discovery and clustering of speech signals

Language Recognition using Neural Phone Embeddings and RNNLMs

Weighted fast sequential DTW for multilingual audio Query-by-Example retrieval

A comparison of three spectral features for phone recognition in sub-optimal environments

The influence of the assimilation operator, speech rate and linguistic boundary on the production of /z/ in Croatian

RHETORIC (PROSODY) IN THE LYRICS OF BIDDLE DEHLAVĪ

An analysis of the influence of deep neural network (DNN) topology in bottleneck feature based language recognition.

An Analysis of Automatic Phone Recognition and Identification of a Few Languages from North Eastern India

Heterophonic speech recognition using composite phones.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Phonetic Units Research Articles

Related Topics

Articles published on Phonetic Units

An evaluation of sentence selection methods on the different phone-sized units for constructing Indonesian speech corpus

Basic auditory processing deficits in infants at risk for dyslexia during the sensitive period predict future language

Segmentation of speech on phonetic elements for systems of speech information protection

Phonology based Fuzzy Phoneme Recognition

Itakura–Saito Divergence as an Element of the Information Theory of Speech Perception

Graphemic-phonetic diachronic linguistic invariance of the frequency and of the Index of Coincidence as cryptanalytic tools.

А Study on Ethnic Minority Languages in Siberia: Focused on Buryat Language

Development and analysis of multilingual phone recognition systems using Indian languages

Comparison of Phonemic and Graphemic Word to Sub-Word Unit Mappings for Lithuanian Phone-Level Speech Transcription

A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing

A robust unsupervised pattern discovery and clustering of speech signals

Language Recognition using Neural Phone Embeddings and RNNLMs

Weighted fast sequential DTW for multilingual audio Query-by-Example retrieval

A comparison of three spectral features for phone recognition in sub-optimal environments

The influence of the assimilation operator, speech rate and linguistic boundary on the production of /z/ in Croatian

RHETORIC (PROSODY) IN THE LYRICS OF BIDDLE DEHLAVĪ

An analysis of the influence of deep neural network (DNN) topology in bottleneck feature based language recognition.

An Analysis of Automatic Phone Recognition and Identification of a Few Languages from North Eastern India

Heterophonic speech recognition using composite phones.