Pathological Speech Research Articles

This paper presents a study of the approaches in the state-of-the-art in the field of pathological speech signal analysis with a special focus on parametrization techniques. It provides a description of 92 speech features where some of them are already widely used in this field of science and some of them have not been tried yet (they come from different areas of speech signal processing like speech recognition or coding). As an original contribution, this work introduces 36 completely new pathological voice measures based on modulation spectra, inferior colliculus coefficients, bicepstrum, sample and approximate entropy and empirical mode decomposition. The significance of these features was tested on 3 (English, Spanish and Czech) pathological voice databases with respect to classification accuracy, sensitivity and specificity. To our best knowledge the introduced approach based on complex feature extraction and robust testing outperformed all works that have been published already in this field. The results (accuracy, sensitivity and specificity equal to 100.0±0.0%) are discussable in the case of Massachusetts Eye and Ear Infirmary (MEEI) database because of its limitation related to a length of sustained vowels, however in the case of Príncipe de Asturias (PdA) Hospital in Alcalá de Henares of Madrid database we made improvements in classification accuracy (82.1±3.3%) and specificity (83.8±5.1%) when considering a single-classifier approach. Hopefully, large improvements may be achieved in the case of Czech Parkinsonian Speech Database (PARCZ), which are discussed in this work as well. All the features introduced in this work were identified by Mann–Whitney U test as significant (p<0.05) when processing at least one of the mentioned databases. The largest discriminative power from these proposed features has a cepstral peak prominence extracted from the first intrinsic mode function (p=6.9443×10−32) which means, that among all newly designed features those that quantify especially hoarseness or breathiness are good candidates for pathological speech identification. The paper also mentions some ideas for the future work in the field of pathological speech signal analysis that can be valuable especially under the clinical point of view.

Read full abstract

Pathological speech usually refers to the condition of speech distortion resulting from atypicalities in voice and/or in the articulatory mechanisms owing to disease, illness or other physical or biological insult to the production system. Although automatic evaluation of speech intelligibility and quality could come in handy in these scenarios to assist experts in diagnosis and treatment design, the many sources and types of variability often make it a very challenging computational processing problem. In this work we propose novel sentence-level features to capture abnormal variation in the prosodic, voice quality and pronunciation aspects in pathological speech. In addition, we propose a post-classification posterior smoothing scheme which refines the posterior of a test sample based on the posteriors of other test samples. Finally, we perform feature-level fusions and subsystem decision fusion for arriving at a final intelligibility decision. The performances are tested on two pathological speech datasets, the NKI CCRT Speech Corpus (advanced head and neck cancer) and the TORGO database (cerebral palsy or amyotrophic lateral sclerosis), by evaluating classification accuracy without overlapping subjects’ data among training and test partitions. Results show that the feature sets of each of the voice quality subsystem, prosodic subsystem, and pronunciation subsystem, offer significant discriminating power for binary intelligibility classification. We observe that the proposed posterior smoothing in the acoustic space can further reduce classification errors. The smoothed posterior score fusion of subsystems shows the best classification performance (73.5% for unweighted, and 72.8% for weighted, average recalls of the binary classes).

Read full abstract

Pathological Speech Research Articles

Related Topics

Articles published on Pathological Speech

S transform feature for pathological speech

An insight to the automatic categorization of speakers according to sex and its application to the detection of voice pathologies: A comparative study

Speech Databases of Typical Children and Children with SLI

Empirically Estimable Classification Bounds Based on a Nonparametric Divergence Measure.

The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations

Algorithm for Jitter and Shimmer Measurement in Pathologic Voices

Robust and complex approach of pathological speech signal analysis

The Separation of Multi-Class Pathological Speech Signals Related to Vocal Cords Disorders Using Adaptation Wavelet Transform Based on Lifting Scheme

Functional imaging of physiological and pathological speech production

A Pathological Voices Assessment Using Classification

A Pathological Voices Assessment Using Classification

Modeling Pathological Speech Perception From Data With Similarity Labels.

Automatic system to detect the type of voice pathology

Automatic intelligibility classification of sentence-level pathological speech

Feature divergence of pathological speech

Signal Processing and Analysis of Pathological Speech Using Artificial Intelligence and Learning Systems Methods

Towards A Clinical Tool For Automatic Intelligibility Assessment.

Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties

Pathological speech signal analysis and classification using empirical mode decomposition

An Investigation of Vocal Tract Characteristics for Acoustic Discrimination of Pathological Voices

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Pathological Speech Research Articles

Related Topics

Articles published on Pathological Speech

S transform feature for pathological speech

An insight to the automatic categorization of speakers according to sex and its application to the detection of voice pathologies: A comparative study

Speech Databases of Typical Children and Children with SLI

Empirically Estimable Classification Bounds Based on a Nonparametric Divergence Measure.

The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations

Algorithm for Jitter and Shimmer Measurement in Pathologic Voices

Robust and complex approach of pathological speech signal analysis

The Separation of Multi-Class Pathological Speech Signals Related to Vocal Cords Disorders Using Adaptation Wavelet Transform Based on Lifting Scheme

Functional imaging of physiological and pathological speech production

A Pathological Voices Assessment Using Classification

A Pathological Voices Assessment Using Classification

Modeling Pathological Speech Perception From Data With Similarity Labels.

Automatic system to detect the type of voice pathology

Automatic intelligibility classification of sentence-level pathological speech

Feature divergence of pathological speech

Signal Processing and Analysis of Pathological Speech Using Artificial Intelligence and Learning Systems Methods

Towards A Clinical Tool For Automatic Intelligibility Assessment.

Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties

Pathological speech signal analysis and classification using empirical mode decomposition

An Investigation of Vocal Tract Characteristics for Acoustic Discrimination of Pathological Voices