Excitation Source Features Research Articles

Children with autism spectrum disorder (ASD) produce speech sounds different from that of Normal or non-ASD children. Hence, analyzing acoustic features can help characterizing the ASD speech signals. In this study, the distinguishing characteristics of speech production are examined for ASD affected children, with comparison to Normal children’s speech. Acoustic features are analyzed first and then classification of ASD vs Normal speech is attempted using different machine learning techniques. Two speech sound databases are recorded for this study: the speech database of children with ASD and the speech database of Normal children. English speech utterances are recorded for children of Indian regional (Tamil and Telugu) nativity. The changes due to autism effect are examined in context of 5 English vowel sounds (/a/, /e/, /i/, /o/, and /u/). Changes in the speech production characteristics of children are explored using three sets of features. Firstly, changes in the excitation source features are examined using strength of excitation (SoE) and instantaneous fundamental frequency (F0). Secondly, changes in the vocal tract (VT) filter features are examined using dominant frequencies (FD1, FD2) and formant frequencies (F1 to F5). Thirdly, changes in the source-filter combined features are examined using signal energy (E), zero-crossing rate (ZCR), linear prediction cepstrum coefficients (LPCC), and Mel-frequency cepstral coefficients (MFCC). Then, various combinations of the acoustic features are classified utilizing machine learning methods such as probabilistic neural network (PNN), multilayer perceptron (MLP), support vector machine (SVM), and K-nearest neighbors (KNN). Analyses of acoustic features shows significant differences between the speech of children with ASD and the Normal children. Results up to 98.17% accuracy are obtained for classification between acoustic features of the speech sounds of children with ASD and the Normal children. The observations and this study results may be useful as acoustic biomarkers to identify autism and its progression/cure among children. This study may also be valuable towards developing a system for ASD diagnosis from children’s speech sounds, in the future.

Read full abstract

An objective machine-driven measure of song intelligibility would be of great utility for various music information retrieval tasks. Song intelligibility mostly depends on two factors, the amount of interference caused by background accompaniment, and the quality of singing vocal. We leverage these two factors to determine the intelligibility of a song. For the first factor, we adapt a well known method for intelligibility prediction of noisy speech, short term objective intelligibility (STOI), to singing. The singing-adapted STOI considers the polyphonic song as a time-frequency weighted noisy version of the extracted singing vocal. We use U-net based audio source separation method to extract singing vocal from a polyphonic song. The singing vocal shares the same underlying physiological mechanism for production as that of speech, with some differences in the pronunciation and prosody of the phonemes. Therefore, for the second factor, we have introduced vocal-specific features to measure the intelligibility of the singing vocal, which are excitation source, spectral, and prosodic singing characteristics. We perform detailed analysis on each of these features to establish their efficacy for quantifying song intelligibility. We train a regression model to derive the intelligibility scores using a combination of the vocal-specific features and singing adapted STOI, obtaining a significant improvement in performance. The correlation between the intelligibility score obtained using proposed framework and human-rated intelligibility score is 0.81, which shows the efficacy of the proposed approach.

Read full abstract

Excitation Source Features Research Articles

Related Topics

Articles published on Excitation Source Features

Automatic diagnosis of COVID-19 related respiratory diseases from speech.

PZT and PVDF piezoelectric transducers’ design implications on their efficiency and energy harvesting potential

Noise spectrum characteristics of marine pump units induced by different excitation sources

Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features

Analysis of excitation source characteristics and their contribution in a 2-cylinder diesel engine

Detection of replay signals using excitation source and shifted CQCC features

Robust vowel region detection method for multimode speech

Estimation of age from speech using excitation source features

DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features

Stability analysis of floating raft system under multiexcitation condition

Multilingual and multimode phone recognition system for Indian languages

Exploration of excitation source information for shouted and normal speech classification.

Automatic Evaluation of Song Intelligibility Using Singing Adapted STOI and Vocal-Specific Features

Analysis of aperiodicity in artistic Noh singing voice using an impulse sequence representation of excitation source.

Quantum Confinement Explains Pump-Dependent Luminescence from Butyl-Terminated Si Quantum Dots

Usefulness of linear prediction residual for replay attack detection

Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features

On a space-time regularization for force reconstruction problems

Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification

I-Vector-Based Speaker Verification on Limited Data Using Fusion Techniques

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Excitation Source Features Research Articles

Related Topics

Articles published on Excitation Source Features

Automatic diagnosis of COVID-19 related respiratory diseases from speech.

PZT and PVDF piezoelectric transducers’ design implications on their efficiency and energy harvesting potential

Noise spectrum characteristics of marine pump units induced by different excitation sources

Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features

Analysis of excitation source characteristics and their contribution in a 2-cylinder diesel engine

Detection of replay signals using excitation source and shifted CQCC features

Robust vowel region detection method for multimode speech

Estimation of age from speech using excitation source features

DNN-HMM-Based Speaker-Adaptive Emotion Recognition Using MFCC and Epoch-Based Features

Stability analysis of floating raft system under multiexcitation condition

Multilingual and multimode phone recognition system for Indian languages

Exploration of excitation source information for shouted and normal speech classification.

Automatic Evaluation of Song Intelligibility Using Singing Adapted STOI and Vocal-Specific Features

Analysis of aperiodicity in artistic Noh singing voice using an impulse sequence representation of excitation source.

Quantum Confinement Explains Pump-Dependent Luminescence from Butyl-Terminated Si Quantum Dots

Usefulness of linear prediction residual for replay attack detection

Detection of Nasalized Voiced Stops in Cleft Palate Speech Using Epoch-Synchronous Features

On a space-time regularization for force reconstruction problems

Exploring Text-Constraint Models and Source Information for Long-Enrollment with Short-Test Speaker Verification

I-Vector-Based Speaker Verification on Limited Data Using Fusion Techniques