F0 Trajectories Research Articles

Perceptual prominence of linguistic units such as words has been earlier connected to the concepts of predictability and attentional orientation. One hypothesis is that low-probability prosodic or lexical content is perceived as prominent due to the surprisal and high information value associated with the stimulus. However, the existing behavioral studies have used stimulus manipulations that follow or violate typical linguistic patterns present in the listeners’ native language, i.e., assuming that the listeners have already established a model for acceptable prosodic patterns in the language. In the present study, we investigated whether prosodic expectations and the resulting subjective impression of prominence is affected by brief statistical adaptation to suprasegmental acoustic features in speech, also in the case where the prosodic patterns do not necessarily follow language-typical marking for prominence. We first exposed listeners to five minutes of speech with uneven distributions of falling and rising fundamental frequency (F0) trajectories on sentence-final words, and then tested their judgments of prominence on a set of new utterances. The results show that the probability of the F0 trajectory affects the perception of prominence, a less frequent F0 trajectory making a word more prominent independently of the absolute direction of F0 change. In the second part of the study, we conducted EEG-measurements on a set of new subjects listening to similar utterances with predominantly rising or falling F0 on sentence-final words. Analysis of the resulting event-related potentials (ERP) reveals a significant difference in N200 and N400 ERP-component amplitudes between standard and deviant prosody, again independently of the F0 direction and the underlying lexical content. Since N400 has earlier been associated with semantic processing of stimuli, this suggests that listeners implicitly track probabilities at the suprasegmental level and that predictability of a prosodic pattern during a word has an impact to the semantic processing of the word. Overall, the study suggests that prosodic markers for prominence are at least partially driven by the statistical structure of recently perceived speech, and therefore prominence perception could be based on statistical learning mechanisms similar to those observed in early word learning, but in this case operating at the level of suprasegmental acoustic features.

Read full abstract

When we hear a new voice we automatically form a "first impression" of the voice owner's personality; a single word is sufficient to yield ratings highly consistent across listeners. Past studies have shown correlations between personality ratings and acoustical parameters of voice, suggesting a potential acoustical basis for voice personality impressions, but its nature and extent remain unclear. Here we used data-driven voice computational modelling to investigate the link between acoustics and perceived trustworthiness in the single word "hello". Two prototypical voice stimuli were generated based on the acoustical features of voices rated low or high in perceived trustworthiness, respectively, as well as a continuum of stimuli inter- and extrapolated between these two prototypes. Five hundred listeners provided trustworthiness ratings on the stimuli via an online interface. We observed an extremely tight relationship between trustworthiness ratings and position along the trustworthiness continuum (r = 0.99). Not only were trustworthiness ratings higher for the high- than the low-prototypes, but the difference could be modulated quasi-linearly by reducing or exaggerating the acoustical difference between the prototypes, resulting in a strong caricaturing effect. The f0 trajectory, or intonation, appeared a parameter of particular relevance: hellos rated high in trustworthiness were characterized by a high starting f0 then a marked decrease at mid-utterance to finish on a strong rise. These results demonstrate a strong acoustical basis for voice personality impressions, opening the door to multiple potential applications.

Read full abstract

F0 Trajectories Research Articles

Related Topics

Articles published on F0 Trajectories

Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories

Tonal languages speech synthesis using an indirect pitch markers and the quantitative target approximation methods

The acoustics of feeling: Emotional prosody in the StoryCorps corpus

Prosodic encoding of focus in Hijazi Arabic

Non-native production of Mandarin T2/T3 pairs in disyllabic real and pseudo words

Spectral change and duration as cues in Australian English listeners' front vowel categorization.

Making predictable unpredictable with style – Behavioral and electrophysiological evidence for the critical role of prosodic expectations in the perception of prominence in speech

Investigating very deep highway networks for parametric speech synthesis

The sound of trustworthiness: Acoustic-based modulation of perceived voice personality

A model of Mandarin Chinese question intonation

A Modeling Study of the Effects of Vocal Tract Movement Duration and Magnitude on the F2 Trajectory in CV Words

Investigation of using the highway network to predict the F0 trajectory for text-to-speech synthesis

Study on relationship between subjective reproducibility of individuality and distance measure for vibrato of singing voice

Formant trajectories in the realization of 3 Malayalam rhotics

The Korean Prevocalic Palatal Glide: A Comparison with the Russian Glide and Palatalization

Modeling F0 trajectories in hierarchically structured deep neural networks

Perception of Sentence Stress in Speech Correlates With the Temporal Unpredictability of Prosodic Features.

Variability in analyst decisions during the computation of numerical likelihood ratios

The relevant population in forensic voice comparison: Effects of varying delimitations of social class and age

Speech Compensation for Time-Scale-Modified Auditory Feedback

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

F0 Trajectories Research Articles

Related Topics

Articles published on F0 Trajectories

Bayesian Singing Transcription Based on a Hierarchical Generative Model of Keys, Musical Notes, and F0 Trajectories

Tonal languages speech synthesis using an indirect pitch markers and the quantitative target approximation methods

The acoustics of feeling: Emotional prosody in the StoryCorps corpus

Prosodic encoding of focus in Hijazi Arabic

Non-native production of Mandarin T2/T3 pairs in disyllabic real and pseudo words

Spectral change and duration as cues in Australian English listeners' front vowel categorization.

Making predictable unpredictable with style – Behavioral and electrophysiological evidence for the critical role of prosodic expectations in the perception of prominence in speech

Investigating very deep highway networks for parametric speech synthesis

The sound of trustworthiness: Acoustic-based modulation of perceived voice personality

A model of Mandarin Chinese question intonation

A Modeling Study of the Effects of Vocal Tract Movement Duration and Magnitude on the F2 Trajectory in CV Words

Investigation of using the highway network to predict the F0 trajectory for text-to-speech synthesis

Study on relationship between subjective reproducibility of individuality and distance measure for vibrato of singing voice

Formant trajectories in the realization of 3 Malayalam rhotics

The Korean Prevocalic Palatal Glide: A Comparison with the Russian Glide and Palatalization

Modeling F0 trajectories in hierarchically structured deep neural networks

Perception of Sentence Stress in Speech Correlates With the Temporal Unpredictability of Prosodic Features.

Variability in analyst decisions during the computation of numerical likelihood ratios

The relevant population in forensic voice comparison: Effects of varying delimitations of social class and age

Speech Compensation for Time-Scale-Modified Auditory Feedback