Voice Production Mechanisms Research Articles

Automatic voice pathology detection enables objective assessment of pathologies that affect the voice production mechanism. Detection systems have been developed using the traditional pipeline approach (consisting of the feature extraction part and the detection part) and using the modern deep learning -based end-to-end approach. Due to the lack of vast amounts of training data in the study area of pathological voice, the former approach is still a valid choice. In the existing detection systems based on the traditional pipeline approach, the mel-frequency cepstral coefficient (MFCC) features can be regarded as the defacto standard feature set. In this study, automatic voice pathology detection is investigated by comparing the performance of various MFCC variants derived by considering two factors: the input and the filterbank in the cepstrum computation. For the first factor, three inputs (the voice signal, the glottal source and the vocal tract) are compared. The glottal source and the vocal tract are estimated using the quasi-closed phase glottal inverse filtering method. For the second factor, the mel-frequency and linear-frequency filterbanks are compared. Experiments were conducted separately using six databases consisting of voices produced by speakers suffering from one of four disorders (dysphonia, Parkinson’s disease, laryngitis, or heart failure) and by healthy speakers. Support vector machine (SVM) was used as the classifier. The results show that by combining mel- and linear-frequency cepstral coefficients derived from the glottal source and vocal tract, better overall detection accuracy was obtained compared to the defacto MFCC features derived from the voice signal. Furthermore, this combination provided comparable or better performance than four existing cepstral feature extraction techniques in clean and high signal-to-noise ratio (SNR) conditions.

Read full abstract

Voice Production Mechanisms Research Articles

Related Topics

Articles published on Voice Production Mechanisms

Special Issue on Current Trends and Future Directions in Voice Acoustics Measurement

Prevalence of Voice Problems and Associated Risk Factors Among Tamil-Speaking Imams

A Prospective Observational Study to Determine the Added Clinical Value of Videokymography to Videostroboscopy in Patients with Change in Voice.

Laryngeal and Voice Disorders in Patients with Pulmonary Tuberculosis.

Acoustic to kinematic projection in Parkinson’s disease dysarthria

A Comparison of Cepstral Features in the Detection of Pathological Voices by Varying the Input and Filterbank of the Cepstrum Computation

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech

Toward Development of a Vocal Fold Contact Pressure Probe: Bench-Top Validation of a Dual-Sensor Probe Using Excised Human Larynx Models.

GlotNet—A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis

Feature Maps of the Acoustic Spectrum of the Voice

The Influence of Posture and Balance on Voice: A Review

A training-based speech regeneration approach with cascading mapping models

Effects of age on the amplitude, frequency and perceived quality of voice.

Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation

Using image processing technology and mathematical algorithm in the automatic selection of vocal cord opening and closing images from the larynx endoscopy video

Behandlung von Glottisschlussinsuffizienzen

High-Speed Videoendoscopic Analysis of Relationships between Cepstral-Based Acoustic Measures and Voice Production Mechanisms in Patients Undergoing Phonomicrosurgery

Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications

Use of laryngeal high-speed videoendoscopy systems to study voice production mechanisms in human subjects

Mapping emotions into acoustic space: The role of voice production

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Voice Production Mechanisms Research Articles

Related Topics

Articles published on Voice Production Mechanisms

Special Issue on Current Trends and Future Directions in Voice Acoustics Measurement

Prevalence of Voice Problems and Associated Risk Factors Among Tamil-Speaking Imams

A Prospective Observational Study to Determine the Added Clinical Value of Videokymography to Videostroboscopy in Patients with Change in Voice.

Laryngeal and Voice Disorders in Patients with Pulmonary Tuberculosis.

Acoustic to kinematic projection in Parkinson’s disease dysarthria

A Comparison of Cepstral Features in the Detection of Pathological Voices by Varying the Input and Filterbank of the Cepstrum Computation

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech

Toward Development of a Vocal Fold Contact Pressure Probe: Bench-Top Validation of a Dual-Sensor Probe Using Excised Human Larynx Models.

GlotNet—A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis

Feature Maps of the Acoustic Spectrum of the Voice

The Influence of Posture and Balance on Voice: A Review

A training-based speech regeneration approach with cascading mapping models

Effects of age on the amplitude, frequency and perceived quality of voice.

Reconstruction of Phonated Speech from Whispers Using Formant-Derived Plausible Pitch Modulation

Using image processing technology and mathematical algorithm in the automatic selection of vocal cord opening and closing images from the larynx endoscopy video

Behandlung von Glottisschlussinsuffizienzen

High-Speed Videoendoscopic Analysis of Relationships between Cepstral-Based Acoustic Measures and Voice Production Mechanisms in Patients Undergoing Phonomicrosurgery

Glottal inverse filtering analysis of human voice production — A review of estimation and parameterization methods of the glottal excitation and their applications

Use of laryngeal high-speed videoendoscopy systems to study voice production mechanisms in human subjects

Mapping emotions into acoustic space: The role of voice production