Vocal Tract Shape Research Articles

This study aims to investigate acoustic change over time as biomarkers to differentiate among spastic-flaccid dysarthria associated with amyotrophic lateral sclerosis (ALS), spastic dysarthria associated with primary lateral sclerosis (PLS), flaccid dysarthria associated with spinal and bulbar muscular atrophy (SBMA), and to explore how these acoustic parameters are affected by dysarthria severity. Thirty-three ALS patients with mixed flaccid-spastic dysarthria, 17 PLS patients with pure spastic dysarthria, 18 SBMA patients with pure flaccid dysarthria, and 70 controls, all French speakers, were included in the study. Speakers produced vowel-glide sequences targeting different vocal tract shape changes. The mean and coefficient of variation of the total squared change of mel frequency cepstral coefficients were used to capture the degree and variability of acoustic changes linked to vocal tract modifications over time. Differences in duration of acoustic events were also measured. All pathological groups showed significantly less acoustic change compared to controls, reflecting less acoustic contrast in sequences. Spastic and mixed spastic-flaccid dysarthric speakers showed smaller acoustic changes and slower sequence production compared to flaccid dysarthria. For dysarthria subtypes associated with a spastic component, reduced degree of acoustic change was also associated with dysarthria severity. The acoustic parameters partially differentiated among the dysarthria subtypes in relation to motor neuron diseases. While similar acoustic patterns were found in spastic-flaccid and spastic dysarthria, crucial differences were found between these two subtypes relating to variability. The acoustic patterns were much more variable in ALS. This method forms a promising clinical tool as a diagnostic marker of articulatory impairment, even at mild stage of dysarthria progression in all subtypes.

In this paper, we propose a fundamental frequency ( $$f_0$$ ) estimation method for monophonic songs. The proposed method (SongF0) performs simultaneous voiced/unvoiced detection (VUD) and $$f_0$$ estimation from the frequency spectrum. Even though the spectrogram exhibits ample information of the singer $$f_0$$ in the form of harmonic partials, the existing spectrum-based $$f_0$$ detection methods fail to accurately extract the $$f_0$$ due to the complex singing styles. In order to cover a wider range of fundamental frequencies, singers adjust their vocal tract shape by lowering the larynx and widening the pharynx results in tuning lower harmonic partial to formant frequencies (Sundberg in STL-QPSR 1:1–6, 1968; Speech Transm Lab Q Prog Status Rep 4:21–39, 1970). Hence, most of the available popular $$f_0$$ detection methods which are inspired by speech production mechanism are susceptible to the energized higher-order harmonic spectral partials near the formants. In this work, we profoundly explore the quasi-harmonic nature of the spectral peaks to minimize the effect of formant frequencies on the detected $$f_0$$ . Initially, we train an ensemble classifier with the novel spectral features to predict the candidate $$f_0$$ harmonic partials from the spectrum. We exploit the property of constant spectral distance between the harmonic partials to reliably extract the singing $$f_0$$ from the predicted harmonic partials. We propose novel post-processing methods to significantly improve the $$f_0$$ detection accuracy in the weakly voiced and inharmonic transition regions. The proposed SongF0 which is independent of the vocal $$f_0$$ range is compared with the state-of-the-art $$f_0$$ extraction methods proposed for both speech and singing voice. The evaluation results on the openly available singing $$f_0$$ gold standard datasets revealed that the proposed method is significantly better than the several state-of-the-art $$f_0$$ detection methods.

Vocal Tract Shape Research Articles

Related Topics

Articles published on Vocal Tract Shape

Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated

Acoustic Change Over Time in Spastic and/or Flaccid Dysarthria in Motor Neuron Diseases

Voice source, formant frequencies and vocal tract shape in overtone singing. A case study

Discrete constriction locations describe a comprehensive range of vocal tract shapes in the Maeda model.

Effect of cumulative distance in motor space on segment-sized slips in 3-word phrases

Measurement of Pharyngeal Air Pressure During Phonation Using High-Resolution Manometry.

Dark tone quality and vocal tract shaping in soprano song production: Insights from real-time MRI.

Complexity of vocal tract shaping in glossectomy patients and typical speakers: A principal component analysis.

Using browser-based software to teach acoustic theory of speech production

Acoustically Induced Vocal Training for Individuals With Impaired Hearing

An inverse problem to determine the shape of a human vocal tract

Objective Measures of Two Musical Interpretations of an Excerpt From Berlioz's “La mort d'Ophélie”

$F_0$-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model

Variation of Formant Frequency with Nepalese Vowels

Using visual feedback to tune the second vocal tract resonance for singing in the high soprano range

Acoustic and articulatory analysis and synthesis of shouted vowels

Articulation and identification of voiced stop consonants produced by acoustically driven vocal tract modulations

Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties

SongF0: A Spectrum-Based Fundamental Frequency Estimation for Monophonic Songs

How an aglossic speaker produces an alveolar-like percept without a functional tongue tip.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Vocal Tract Shape Research Articles

Related Topics

Articles published on Vocal Tract Shape

Automatic generation of the complete vocal tract shape from the sequence of phonemes to be articulated

Acoustic Change Over Time in Spastic and/or Flaccid Dysarthria in Motor Neuron Diseases

Voice source, formant frequencies and vocal tract shape in overtone singing. A case study

Discrete constriction locations describe a comprehensive range of vocal tract shapes in the Maeda model.

Effect of cumulative distance in motor space on segment-sized slips in 3-word phrases

Measurement of Pharyngeal Air Pressure During Phonation Using High-Resolution Manometry.

Dark tone quality and vocal tract shaping in soprano song production: Insights from real-time MRI.

Complexity of vocal tract shaping in glossectomy patients and typical speakers: A principal component analysis.

Using browser-based software to teach acoustic theory of speech production

Acoustically Induced Vocal Training for Individuals With Impaired Hearing

An inverse problem to determine the shape of a human vocal tract

Objective Measures of Two Musical Interpretations of an Excerpt From Berlioz's “La mort d'Ophélie”

$F_0$-Noise-Robust Glottal Source and Vocal Tract Analysis Based on ARX-LF Model

Variation of Formant Frequency with Nepalese Vowels

Using visual feedback to tune the second vocal tract resonance for singing in the high soprano range

Acoustic and articulatory analysis and synthesis of shouted vowels

Articulation and identification of voiced stop consonants produced by acoustically driven vocal tract modulations

Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties

SongF0: A Spectrum-Based Fundamental Frequency Estimation for Monophonic Songs

How an aglossic speaker produces an alveolar-like percept without a functional tongue tip.