Vowel Samples Research Articles

Sustained vowels are important vocal tasks that have been investigated in discriminating voice disorders using acoustic analysis. To date, no study has combined vowel acoustic measures only that evaluate major aspects of the pathological voice signals in voice disorder discrimination. To investigate the value of vowel acoustic measures that quantify glottal noise, signal stability, signal periodicity, spectral slope and overall voice quality in discriminating female speakers with and without voice disorders. Sustained vowel /ɑ/ samples were extracted from 133 voice-disordered female patients and 97 non-voice disordered female speakers and were signal typed prior to analysis. Praat software was used to measure harmonics-to-noise ratio (HNR), glottal-to-noise excitation ratio (GNE), the standard deviation of fundamental frequency (F0SD) and cepstral peak prominence (CPPp); and the Analysis of Dysphonia in Speech and Voice (ADSV) program was used to measure CPPadsv, low/high spectral ratio (LH) and the cepstral/spectral index of dysphonia (CSID). Outcome measures included sensitivity, specificity, and discrimination accuracy. As individual acoustic measures, only spectral-based measures showed good (CPPadsv) and acceptable (CSID) discrimination results. The HNR, GNE and CPPp measures had acceptable sensitivity but poor or non-acceptable specificity and discrimination accuracy. Logistic regression models with all Praat measures (F0SD, HNR, GNE, CPPp) plus ADSV measures (CPPadsv, LH or CSID) provided excellent sensitivity, good-to-excellent specificity and excellent discrimination accuracy. ROC analysis for all individual measures showed that CPPadsv, CSID, CPPp, GNE and F0SD had the highest area under the curve (AUC) values. A combination of acoustic measures that evaluate the major aspects of vocal dysfunction resulted in good to excellent voice discrimination outcomes. Individual acoustic measures had lower discrimination ability than combined measures. The findings implied that acoustic measures extracted from a prolonged vowel were useful in voice disorder discrimination. What is already known on this subject Acoustic measures hold great value in discriminating voice disorders from normal voices. However, no study has evaluated discrimination values of a combination of sustained vowel acoustic measures that quantify additive noise, signal stability, signal periodicity, spectral slope and overall voice quality in single-gender cohorts. Previous studies have not used signal typing (the classification of the acoustic signals) for time-based measures, impacting the reliability of discrimination. What this study adds to the existing knowledge This study was the first to implement signal typing to include sustained vowel samples of Types 1 and 2 signals for discrimination statistics. We showed that a combination of vocal acoustic measures using time- and spectral-based extraction from the sustained /ɑ/ vowel evaluating additive noise, signal stability, signal periodicity, spectral slope and overall voice quality resulted in good to excellent sensitivity, specificity and discrimination accuracy. As individual measures, traditional time-based measures such as HNR had rather limited discrimination values whilst spectral-based measures provided higher discrimination values. Measures that are sensitive to signal types have low discrimination ability. What are the potential or actual clinical implications of this work? The sustained vowel /ɑ/ is a relevant, universal vocal task for clinical application using acoustic measures to discriminate female speakers with and without voice disorders if signal typing is implemented. Clinical voice assessment using vowels may not be effective if relying solely on time-based measurements. Spectral-based measures perform better in voice disorder discrimination given their insensitivity to signal types. The most effective voice disorder discrimination could only be obtained using a combination of acoustic measures that quantify major phenomena in the signals of disordered voices. Using measures extracted from both programs, Praat and ADSV, is useful given that specific settings in a program may impact on discrimination accuracy.

Read full abstract

To investigate the impact of standardized mobile phone recordings passed through a telecom channel on acoustic markers of voice quality and on its perception by voice experts in normophonic speakers. Continuous speech and a sustained vowel were recorded for fourteen female and ten male normophonic speakers. The recordings were done simultaneously with a head-mounted high-quality microphone and through the telephone network on a receiving smartphone. Twenty-two acoustic voice quality, breathiness and pitch-related measures were extracted from the recordings. Nine vocologists perceptually rated the G, R and B parameters of the GRBAS scale on each voice sample. The reproducibility, the recording type, the stimulus type and the gender effects, as well as the correlation between acoustic and perceptual measures were investigated. The sustained vowel samples are damped after one second. Only the frequencies between 100 and 3700Hz are passed through the telecom channel and the frequency response is characterized by peaks and troughs. The acoustic measures show a good reproducibility over the three repetitions. All measures significantly differ between the recording types, except for the local jitter, the harmonics-to-noise ratio by Dejonckere and Lebacq, the period standard deviation and all six pitch measures. The AVQI score is higher in telephone recordings, while the ABI score is lower. Significant differences between genders are also found for most of the measures; while the AVQI is similar in men and women, the ABI is higher in women in both recording types. For the perceptual assessment, the interrater agreement is rather low, while the reproducibility over the three repetitions is good. Few significant differences between recording types are observed, except for lower breathiness ratings on telephone recordings. G ratings are significantly more severe on the sustained vowel on both recording types, R ratings only on telephone recordings. While roughness is rated higher in men on telephone recordings by most experts, no gender effect is observed for breathiness on either recording types. Finally, neither the AVQI nor the ABI yield strong correlations with any of the perceptual parameters. Our results show that passing a voice signal through a telecom channel induces filter and noise effects that limit the use of common acoustic voice quality measures and indexes. The AVQI and ABI are both significantly impacted by the recording type. The most reliable acoustic measures seem to be pitch perturbation (local jitter and period standard deviation) as well as the harmonics-to-noise ratio from Dejonckere and Lebacq. Our results also underline that raters are not equally sensitive to the various factors, including the recording type, the stimulus type and the gender effects. Neither of the three perceptual parameters G, R and B seem to be reliably measurable on telephone recordings using the two investigated acoustic indexes. Future studies investigating the impact of voice quality in telephone conversations should thus focus on acoustic measures on continuous speech samples that are limited to the frequency response of the telecom channel and that are not too sensitive to environmental and additive noise.

Read full abstract

Vowel Samples Research Articles

Related Topics

Articles published on Vowel Samples

Can acoustic measurements predict gender perception in the voice?

Validation of the Acoustic Voice Quality Index Version 03.01 in Turkish

Effectiveness of Temporal Auditory Skills Training Associated With Conventional Auditory Training in the Auditory-Perceptual Judgment of Voice: Preliminary Data

Cepstral Acoustic Measurements: Influence of Speech Task and Degree of Vocal Deviation

Evidence-Based Recommendations for Tablet Recordings From the Bridge2AI-Voice Acoustic Experiments

Usefulness of Direct Magnitude Estimation (DME) and Acoustic Analysis in Measuring Dysphonia Severity

Voice disorder discrimination using vowel acoustic measures in female speakers.

Effect of Face Masks on Voice Quality Associated with Young and Older Chinese Adult Speakers

Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.

Effects of Semi-occluded Vocal Tract Exercises with Different Vibration Sources and Duration on Healthy Adults’ Voice

Voice pathology identification system using a deep learning approach based on unique feature selection sets

Smartphone Recordings are Comparable to “Gold Standard” Recordings for Acoustic Measurements of Voice

Analysis of different performance times of the voiced trill technique in older women.

Análise dos tempos de execução da técnica de sons vibrantes em idosas

Different Performances of Machine Learning Models to Classify Dysphonic and Non-Dysphonic Voices

Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data

The Mongolian Vowel Acoustic Model Based on the Clustering Algorithm.

Voice Quality in Telephone Interviews: A preliminary Acoustic Investigation

Detection of Glottic Neoplasm Based on Voice Signals Using Deep Neural Networks

The Effect of Microphone Frequency Response on Spectral and Cepstral Measures of Voice: An Examination of Low-Cost Electret Headset Microphones.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Vowel Samples Research Articles

Related Topics

Articles published on Vowel Samples

Can acoustic measurements predict gender perception in the voice?

Validation of the Acoustic Voice Quality Index Version 03.01 in Turkish

Effectiveness of Temporal Auditory Skills Training Associated With Conventional Auditory Training in the Auditory-Perceptual Judgment of Voice: Preliminary Data

Cepstral Acoustic Measurements: Influence of Speech Task and Degree of Vocal Deviation

Evidence-Based Recommendations for Tablet Recordings From the Bridge2AI-Voice Acoustic Experiments

Usefulness of Direct Magnitude Estimation (DME) and Acoustic Analysis in Measuring Dysphonia Severity

Voice disorder discrimination using vowel acoustic measures in female speakers.

Effect of Face Masks on Voice Quality Associated with Young and Older Chinese Adult Speakers

Validity of Acoustic Measures Obtained Using Various Recording Methods Including Smartphones With and Without Headset Microphones.

Effects of Semi-occluded Vocal Tract Exercises with Different Vibration Sources and Duration on Healthy Adults’ Voice

Voice pathology identification system using a deep learning approach based on unique feature selection sets

Smartphone Recordings are Comparable to “Gold Standard” Recordings for Acoustic Measurements of Voice

Analysis of different performance times of the voiced trill technique in older women.

Análise dos tempos de execução da técnica de sons vibrantes em idosas

Different Performances of Machine Learning Models to Classify Dysphonic and Non-Dysphonic Voices

Automatic GRBAS Scoring of Pathological Voices using Deep Learning and a Small Set of Labeled Voice Data

The Mongolian Vowel Acoustic Model Based on the Clustering Algorithm.

Voice Quality in Telephone Interviews: A preliminary Acoustic Investigation

Detection of Glottic Neoplasm Based on Voice Signals Using Deep Neural Networks

The Effect of Microphone Frequency Response on Spectral and Cepstral Measures of Voice: An Examination of Low-Cost Electret Headset Microphones.