Glottal Features Research Articles

The properties of acoustic speech have previously been investigated as possible cues for depression in adults. However, these studies were restricted to small populations of patients and the speech recordings were made during patients' clinical interviews or fixed-text reading sessions. Symptoms of depression often first appear during adolescence at a time when the voice is changing, in both males and females, suggesting that specific studies of these phenomena in adolescent populations are warranted. This study investigated acoustic correlates of depression in a large sample of 139 adolescents (68 clinically depressed and 71 controls). Speech recordings were made during naturalistic interactions between adolescents and their parents. Prosodic, cepstral, spectral, and glottal features, as well as features derived from the Teager energy operator (TEO), were tested within a binary classification framework. Strong gender differences in classification accuracy were observed. The TEO-based features clearly outperformed all other features and feature combinations, providing classification accuracy ranging between 81%-87% for males and 72%-79% for females. Close, but slightly less accurate, results were obtained by combining glottal features with prosodic and spectral features (67%-69% for males and 70%-75% for females). These findings indicate the importance of nonlinear mechanisms associated with the glottal flow formation as cues for clinical depression.

An automatic technique for estimating and modeling the glottal flow derivative source waveform from speech, and applying the model parameters to speaker identification, is presented. The estimate of the glottal flow derivative is decomposed into coarse structure, representing the general flow shape, and fine structure, comprising aspiration and other perturbations in the flow, from which model parameters are obtained. The glottal flow derivative is estimated using an inverse filter determined within a time interval of vocal-fold closure that is identified through differences in formant frequency modulation during the open and closed phases of the glottal cycle. This formant motion is predicted by Ananthapadmanabha and Fant (1982) to be a result of time-varying and nonlinear source/vocal tract coupling within a glottal cycle. The glottal flow derivative estimate is modeled using the Liljencrants-Fant (1986) model to capture its coarse structure, while the fine structure of the flow derivative is represented through energy and perturbation measures. The model parameters are used in a Gaussian mixture model speaker identification (SID) system. Both coarse- and fine-structure glottal features are shown to contain significant speaker-dependent information. For a large TIMIT database subset, averaging over male and female SID scores, the coarse-structure parameters achieve about 60% accuracy, the fine-structure parameters give about 40% accuracy, and their combination yields about 70% correct identification. Finally, in preliminary experiments on the counterpart telephone-degraded NTIMIT database, about a 5% error reduction in SID scores is obtained when source features are combined with traditional mel-cepstral measures.

Glottal Features Research Articles

Related Topics

Articles published on Glottal Features

Particle swarm optimization based feature enhancement and feature selection for improved emotion recognition in speech and glottal signals.

A comparative study of glottal source estimation techniques

Detection of Clinical Depression in Adolescents’ Speech During Family Interactions

The Distribution of Laryngeal Segments in Capanahua

Spoken emotion recognition through optimum-path forest classification using glottal features

Critical Analysis of the Impact of Glottal Features in the Classification of Clinical Depression in Speech

The Lexical and Post-Lexical Phonology of Thai Tones*

Modeling of the glottal flow derivative waveform with application to speaker identification

Three Glottal Features

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Glottal Features Research Articles

Related Topics

Articles published on Glottal Features

Particle swarm optimization based feature enhancement and feature selection for improved emotion recognition in speech and glottal signals.

A comparative study of glottal source estimation techniques

Detection of Clinical Depression in Adolescents’ Speech During Family Interactions

The Distribution of Laryngeal Segments in Capanahua

Spoken emotion recognition through optimum-path forest classification using glottal features

Critical Analysis of the Impact of Glottal Features in the Classification of Clinical Depression in Speech

The Lexical and Post-Lexical Phonology of Thai Tones*

Modeling of the glottal flow derivative waveform with application to speaker identification

Three Glottal Features