Phoneme Level Research Articles

In order to expand the research area of Korean language pedagogy, this paper attempted to explore convergence research tasks with academic fields that have the commonality of Korean, such as Speech-Language Pathology and auditory studies. Speech-Language pathology is essentially an academic field that seeks ways to solve problems for people who have communication problems. Audiology is also an academic field that focuses on how to overcome communication difficulties using speech sounds based on the perception characteristics of speech sounds. As such, the need for convergence research to solve tasks that are difficult to solve independently in each academic field is great in that they are interested in the nature of Korean and its usage patterns. In the field of Speech-Language Pathology, in order to solve problems related to language development, standards for accurately measuring the degree of language development and treating people with delayed development or disabilities are needed. Although there is a need for convergence research with the Korean Language Pedagogy on how to prepare standards and guide language functions such as reading, writing, speaking, and listening, as well as essential vocabulary and grammar dlements used in everyday life, according to age and school age, such research has not been sufficiently conducted. There is no standard for treating this because it does not clearly present clear standards for language ability development, and it is difficult to present appropriate and effective educational methods in terms of educational methodology. In this regard, it was argued that future research on the development and rehabilitation methods of language development diagnostic tools should be conducted. In the field of Audiology, research related to the development of speech perception test tools is needed. For full communication, it is necessary to develop a tool that can test the communication ability of words, phrases, sentences, and discourse units beyond the stimulation of speech sounds at the phoneme level. Although there are differences in educational methods depending on the subject of education, linguistic pathology, and audiology, the fundamental content of education is very large. In the case of researchers, there may be a tendency to think that Korean language pedagogy has conducted relatively more research related to the characteristics of the Korean language than speech-language pathology and audiology. However, it is necessary to acknowledge that the research results on the nature of the Korean language required by neighboring disciplines are insignificant, and to conduct research to identify the nature and usage patterns of the Korean language through communication and cooperation with neighboring disciplines.

Read full abstract

In the last 10 decades various methods have been introduced to detect prolonged speech segments automatically for stuttered speech signals. However less attention has been paid by researches in the detection of prolongation disorder at the parametric level. The aim of this study is to propose a hybrid approach to detect the prolonged speech segments by combining various spectral parameters with their recognition accuracies for the reconstructed speech signal. The paper presents prolonged segments detection by considering the parameters individually, combining various spectral parameters, validation of prolongation detection system, MFCC feature extraction process, basic model accuracies for the reconstructed signals. The proposed methods are simulated and experimented on UCLASS derived dataset. Obtained results are compared with the existing works of prolongation detection at parametric and word level. It is observed that hybrid parameters yield 92% of recognition rate for larger frame sizes of 200ms when modeled with SVM. The results are also tabulated and discussed for various metrics like sensitivity, specificity and accuracy metrics in detecting the prolonged segments. The study also focuses on the prolongation characteristics of vocalized and non-vocalized sounds at phoneme level. The detection accuracy of 71% is observed for Vocalized prolonged vowel phonemes over non-vocalized prolonged signal. Objectives: The objective of this work is to propose a hybrid algorithm to detect prolonged segments automatically for speech signal with prolongation disorder. The other objective is to evaluate the obtained spectral parameters performances by applying to various evaluation metrics and models to compute the recognition accuracy of a reconstructed signal. The objective is further extended to bring out the importance of variable frame size concept and to analyze the variations in vocalized and non-vocalized sounds. Methods: The methods adopted to detect prolonged speech segments are discussed at two levels namely at the preprocessing and modeling levels. The Preprocessing level is discussed by applying various parameters at an individual level, hybrid level by combing the Centroid, Entropy, Energy, ZCR parameters and MFCC feature extraction method. A new method has been applied using Specificity, Sensitivity and accuracy metrics to validate the prolongation detection model performance. In modeling level, the above parameters are discussed by applying evaluation metrics for the clustering and classification models like K-means, FCM and SVM. The performance of these methods is considered for evaluating and estimating the prolonged segment detection accuracy of the reconstructed speech signals of vocalized and non-vocalized sounds. All these methods are discussed in detail in the following sections. Findings: Hybridizing the spectral parameters to detect the prolonged speech segment automatically is a major finding of this work. It is also found that Specificity, sensitivity and accuracy metrics plays a major role in designing and validating the prolongation detection model. From the further experiments it is identified that the hybrid and verification metrics suits better for vocalized and non-vocalized sounds when larger frame lengths are considered. SVM has been found to perform better for all the above considerations. Novelty: As per Literature survey it is observed that individual and few parameters are applied to detect the prolongation. But works are not addressed on applying or combining more than two parameters to detect the prolonged speech segments. The novelty of this work lies in selecting and combining the spectral parameters at the preprocessing stage to detect the prolongation disorder. Spectral centroid and entropy are considered as appropriate parameters along with ZCR and Energy parameters. Hence hybridizing these parameters results in a novelty to propose an automatic prolongation detection system. Novelty is further brought by applying Specificity, sensitivity and accuracy metrics to build and evaluate the detection system for vocalized and non-vocalized prolonged sounds.

Read full abstract

Phoneme Level Research Articles

Related Topics

Articles published on Phoneme Level

Train & Constrain: Phonologically Informed Tongue Twister Generation from Topics and Paraphrases

Effects of Lexical Properties in L2 Chinese Compound Processing: A Multivariate Approach.

Synthetic faces generated with the facial action coding system or deep neural networks improve speech-in-noise perception, but not as much as real faces.

한국어교육학과 언어병리학 그리고 청각학의 학문 간 융합 연구 시론

Comparative study of event-related potential responses within syllables of intra and inter phoneme classes

Shared Loanword Recognition in German-English Bilinguals: The Role of Metrical Phonology.

W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision

Phonemic structure of the root morpheme in the Gothic language (a complex quantitative analysis of lexicographic sources)

Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels.

Outline of the problem of developing the phonetic level of speech in older preschool children with logopathology

Towards a Vocal and Acoustic Description of Kapa Haka

Perceiving speech during orthographic syllable recognition: Beyond phonemic identity

Hybrid Approach to Detect Prolonged Speech Segments

Phonological Awareness and Word Reading Fluency Among Young Saudi Learners of English

Securing Liveness Detection for Voice Authentication via Pop Noises

Extracting Spatial Muscle Activation Patterns in Facial and Neck Muscles for Silent Speech Recognition Using High-Density sEMG

A Stylistic Study in Surat Al-Baynah

Development of Small Vocabulary Continuous Speech-to-Text System for Kannada Language/Dialects

Decoding silent speech from high-density surface electromyographic data using transformer

English Speech Scoring System Based on Computer Neural Network

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Phoneme Level Research Articles

Related Topics

Articles published on Phoneme Level

Train &amp; Constrain: Phonologically Informed Tongue Twister Generation from Topics and Paraphrases

Effects of Lexical Properties in L2 Chinese Compound Processing: A Multivariate Approach.

Synthetic faces generated with the facial action coding system or deep neural networks improve speech-in-noise perception, but not as much as real faces.

한국어교육학과 언어병리학 그리고 청각학의 학문 간 융합 연구 시론

Comparative study of event-related potential responses within syllables of intra and inter phoneme classes

Shared Loanword Recognition in German-English Bilinguals: The Role of Metrical Phonology.

W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision

Phonemic structure of the root morpheme in the Gothic language (a complex quantitative analysis of lexicographic sources)

Automatic Detection System for Velopharyngeal Insufficiency Based on Acoustic Signals from Nasal and Oral Channels.

Outline of the problem of developing the phonetic level of speech in older preschool children with logopathology

Towards a Vocal and Acoustic Description of Kapa Haka

Perceiving speech during orthographic syllable recognition: Beyond phonemic identity

Hybrid Approach to Detect Prolonged Speech Segments

Phonological Awareness and Word Reading Fluency Among Young Saudi Learners of English

Securing Liveness Detection for Voice Authentication via Pop Noises

Extracting Spatial Muscle Activation Patterns in Facial and Neck Muscles for Silent Speech Recognition Using High-Density sEMG

A Stylistic Study in Surat Al-Baynah

Development of Small Vocabulary Continuous Speech-to-Text System for Kannada Language/Dialects

Decoding silent speech from high-density surface electromyographic data using transformer

English Speech Scoring System Based on Computer Neural Network

Train & Constrain: Phonologically Informed Tongue Twister Generation from Topics and Paraphrases