Speech Fundamental Frequency Research Articles

Background and objectiveThis article presents a multimodal analysis of startle type responses using a variety of physiological, facial, and speech features. These multimodal components of the startle type response reflect complex brain–body reactions to a sudden and intense stimulus. Additionally, the proposed multimodal evaluation of reflexive and emotional reactions associated with the startle eliciting stimuli and underlying neural networks and pathways could be applied in diagnostics of different psychiatric and neurological diseases. Different startle type stimuli can be compared in the strength of their elicitation of startle responses, i.e. their potential to activate stress-related neural pathways, underlying biomarkers and corresponding behavioral reactions. MethodsAn innovative method for measuring startle type responses using multimodal stimuli and multimodal feature analysis has been introduced. Individual's multimodal reflexive and emotional expressions during startle type elicitation have been assessed by corresponding physiological, speech and facial features on ten female students of psychology. Different startle eliciting stimuli like noise and airblast probes, as well as a variety of visual and auditory stimuli of different valence and arousal levels, based on International Affective Picture System (IAPS) images and/or sounds from International Affective Digitized Sounds (IADS) database, have been designed and tested. Combined together into more complex startle type stimuli, such composite stimuli can potentiate the evoked response of underlying neural networks, and corresponding neurotransmitters and neuromodulators as well; this is referred to as increased power of response elicitation. The intensity and magnitude of multimodal responses to selected startle type stimuli have been analyzed using effect sizes and medians of dominant multimodal features, i.e. skin conductance, eye blink, head movement, speech fundamental frequency and energy. The significance of the observed effects and comparisons between paradigms were evaluated using one-tailed t-tests and ANOVA methods, respectively. Skin conductance response habituation was analyzed using ANOVA and post hoc multiple comparison tests with the Dunn–Šidák correction. ResultsThe results revealed specific physiological, facial and vocal reflexive and emotional responses on selected five stimuli paradigms which included: (1) acoustic startle probes, (2) airblasts, (3) IAPS images, (4) IADS sounds, and (5) image-sound-airblast composite stimuli. Overall, composite and airblast paradigms resulted in the largest responses across all analyzed features, followed by sound and acoustic startle paradigms, while paradigm using images consistently elicited the smallest responses. In this context, power of response elicitation of the selected stimuli paradigms can be described according to the aggregated magnitude of the participants’ multimodal responses. We also observed a habituation effect only in skin conductance response to acoustic startle, airblast and sound paradigms. ConclusionsThis study developed a system for paradigm design and stimuli generation, as well as real-time multimodal signal processing and feature calculation. Experimental paradigms for monitoring individual responses to stressful startle type stimuli were designed in order to compare the response elicitation power across various stimuli. The developed system, applied paradigms and obtained results might be useful in further research for evaluation of individuals’ multimodal responses when they are faced with a variety of aversive emotional distractors and stressful situations.

Read full abstract

To the Editor. Speech changes with age, affecting quality of life1,2. Underlying degenerative processes include laryngeal neuromuscular degeneration through atrophy and dystrophy, and edema in the vocal fold cover3–6. Because voice production structures share physiological territory with the aerodigestive tract, age-related degeneration of the voice may coincide with degeneration of other key functions such as breathing, swallowing, and airway protection. Historically, age-related voice studies have been cross-sectional in nature, identifying age-related vocal characteristics by comparing an elderly subject group to a younger group. Although the use of subject groups provides general trends, longitudinal case studies may provide additional insights by tracking the progression of voice, swallowing and breathing characteristics with age without the effects of inter-subject statistical averaging and variability. The current case study uses 50 years (1958–2007, 48–98 y/o) of speech recordings. The subject is a male lay leader of an international church. In addition to the unique longitudinal breadth of his speeches, this subject and his body of speeches are unique because (1) he received no training as a public speaker and used none of the traditional rhetorical characteristics of sermons; (2) he avoided smoking, coffee, and alcohol, common vocal irritants that might obfuscate age-specific changes to the voice; (3) the acoustical environment were consistent, one of two multi-purpose university arenas; and (4) all of the speeches were long enough to provide a sustained representative voice sample for analysis. Two types of analyses were employed: speech fundamental frequency to reveal the current health of the laryngeal physiology, as well as length of speech breath groups to indicate efficiency of laryngeal valving and/or lung vital capacity. Overall, the subject’s voice changed significantly in the mid to latter part of the sixth decade (Figure 1), which could be traced to age-related physiological processes. Generally, speech fundamental frequency decreased until about age 68 (Figure 1a). From age 68 to 98 years, average pitch increased from 140 to 160 Hz and the range (inter-quartile range) decreased by 20 percent. Because speech fundamental frequency depends on the physiology of the vocal folds and control of the musculature of the larynx, changes in mean and range may suggest a deterioration of the state of the tissue and general motor control with age. For example, age-related loss of mass of itself would increase the average speech fundamental frequency; however, decreased mass in the vocal folds could cause the vocal folds to begin to bow7. Further, if the subject adjusted for the bowing by increasing the stretch of the vocal fold to assist with glottal closure during phonation, this would also raise average speech fundamental frequency. Figure 1 (a) Speaking fundamental frequency changes over a lifetime: mode, mean, and median. (b) Average (diamonds) number of words per breath group and standard deviation (squares) of words per breath group, as counted by a reviewer. Solid filled symbols represent ... Changes in speech fundamental frequency corresponded with a reduction of speech breathing length. The subject increased the number of breath groups per minute (6.3% per decade), losing about 6–6.5 percent of speech breath group length per decade (Figure 1b). This change was almost imperceptible until the sixth decade. Simultaneously, the standard deviation of words per breath group decreased nearly linearly throughout the observation period. Thus, the subject could not sustain the same number of words in a breath group and needed to breathe more frequently while speaking. This change might have been caused by (1) a less flexible rib cage and the loss of vital capacity; or (2) increased glottal chink or bowing of the vocal folds8, resulting in more air leakage during speaking and reduce the air available. It is possible the results were affected because variations of recording environment, recording equipment and compression of the audio were not controlled. Nevertheless, the effects were likely minimal because (1) the venues and communication context were similar; (2) the metrics used are less sensitive to these variabilities; and (3) the results were similar to other reports in the literature. Further, while the longitudinal breadth of the study period makes these results valuable, they are nevertheless preliminary because only one subject was examined. Systemic neuromuscular changes can be inferred from changes in speech fundamental frequency and speech breathing. Other changes, such as increased risk of dysphagia (the inability to swallow safely and efficiently), may also correlate with these changes. Additional studies may identify indicators of when further assessments and treatments of age-related changes (e.g., dysphagia, dysphonia) are needed, or when preventative exercise may assist in slowing age indicators9, 10. Future longitudinal studies using more subjects (both genders) may further understanding of normal changes due to aging versus pathology. However, such a corpus of recordings must first be filtered based on communicative intent, venues, knowledge of vocal coaching and related information.

Read full abstract

Speech Fundamental Frequency Research Articles

Related Topics

Articles published on Speech Fundamental Frequency

Pitch segmentation of speech signals based on short-time energy waveform

Characterizing resonant component in speech: A different view of tracking fundamental frequency

Change of speech fundamental frequency explains the satisfaction with voice in response to testosterone therapy in female-to-male gender dysphoric individuals.

Multimodal analysis of startle type responses

THE EFFECTS OF FAMILIAR, UNFAMILIAR MUSIC AND AUDIOBOOKS EXPOSURE ON SPEECH PARAMETERS OF ELDERLY WITHALZHEIMER’S DISEASE: A WITHIN CASE STUDIES

Speech production in the later years: Changes in fundamental frequency and speech breathing

Sex Differences in Pitch Range and Speech Fundamental Frequency After Arytenoid Adduction and Thyroplasty

Aspects of Resonance: Comparison of High Speed Films and Overtone Measurements

A Precision, Large Scale, Anti-Noise Method for Correction of Speech Fundamental Frequency

Increasing diversity of neural responses to speech sounds across the central auditory pathway

Speech tempo and fundamental frequency patterns: A case study of male monozygotic twins and an age- and sex-matched sibling

Age and Speech Production: A 50‐Year Longitudinal Study

Efeitos imediatos do exercício vocal sopro e som agudo

Device and Method for Improving Communication Through Dichotic Input of a Speech Signal

Evaluation of fundamental frequency in individuals with normal voice and those with vocal nodules

Evaluation of speech fundamental frequency and its range in normal farsi-speaking people of both sexes and across different ages via reading a passage

Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages

B-Spline Model Order Selection With Optimal MDL Criterion Applied to Speech Fundamental Frequency Stylization

Prosodic peak estimation under segmental perturbations

Effect of the fundamental frequency and vocal register on the voice pitch compensation.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech Fundamental Frequency Research Articles

Related Topics

Articles published on Speech Fundamental Frequency

Pitch segmentation of speech signals based on short-time energy waveform

Characterizing resonant component in speech: A different view of tracking fundamental frequency

Change of speech fundamental frequency explains the satisfaction with voice in response to testosterone therapy in female-to-male gender dysphoric individuals.

Multimodal analysis of startle type responses

THE EFFECTS OF FAMILIAR, UNFAMILIAR MUSIC AND AUDIOBOOKS EXPOSURE ON SPEECH PARAMETERS OF ELDERLY WITHALZHEIMER’S DISEASE: A WITHIN CASE STUDIES

Speech production in the later years: Changes in fundamental frequency and speech breathing

Sex Differences in Pitch Range and Speech Fundamental Frequency After Arytenoid Adduction and Thyroplasty

Aspects of Resonance: Comparison of High Speed Films and Overtone Measurements

A Precision, Large Scale, Anti-Noise Method for Correction of Speech Fundamental Frequency

Increasing diversity of neural responses to speech sounds across the central auditory pathway

Speech tempo and fundamental frequency patterns: A case study of male monozygotic twins and an age- and sex-matched sibling

Age and Speech Production: A 50‐Year Longitudinal Study

Efeitos imediatos do exercício vocal sopro e som agudo

Device and Method for Improving Communication Through Dichotic Input of a Speech Signal

Evaluation of fundamental frequency in individuals with normal voice and those with vocal nodules

Evaluation of speech fundamental frequency and its range in normal farsi-speaking people of both sexes and across different ages via reading a passage

Detection of Dynamic Structures of Speech Fundamental Frequency in Tonal Languages

B-Spline Model Order Selection With Optimal MDL Criterion Applied to Speech Fundamental Frequency Stylization

Prosodic peak estimation under segmental perturbations

Effect of the fundamental frequency and vocal register on the voice pitch compensation.