Unmodified Speech Research Articles

The task of speaker recognition is feasible when the speakers are co-operative or wish to be recognized. While modern automatic speaker verification (ASV) systems and some listeners are good at recognizing speakers from modal, unmodified speech, the task becomes notoriously difficult in situations of deliberate voice disguise when the speaker aims at masking his or her identity. We approach voice disguise from the perspective of acoustical and perceptual analysis using a self-collected corpus of 60 native Finnish speakers (31 female, 29 male) producing utterances in normal, intended young and intended old voice modes. The normal voices form a starting point and we are interested in studying how the two disguise modes impact the acoustical parameters and perceptual speaker similarity judgments.First, we study the effect of disguise as a relative change in fundamental frequency (F0) and formant frequencies (F1 to F4) from modal to disguised utterances. Next, we investigate whether or not speaker comparisons that are deemed easy or difficult by a modern ASV system have a similar difficulty level for the human listeners. Further, we study affecting factors from listener-related self-reported information that may explain a particular listener’s success or failure in speaker similarity assessment.Our acoustic analysis reveals a systematic increase in relative change in mean F0 for the intended young voices while for the intended old voices, the relative change is less prominent in most cases. Concerning the formants F1 through F4, 29% (for male) and 30% (for female) of the utterances did not exhibit a significant change in any formant value, while the remaining ∼ 70% of utterances had significant changes in at least one formant.Our listening panel consists of 70 listeners, 32 native and 38 non-native, who listened to 24 utterance pairs selected using rankings produced by an ASV system. The results indicate that speaker pairs categorized as easy by our ASV system were also easy for the average listener. Similarly, the listeners made more errors in the difficult trials. The listening results indicate that target (same speaker) trials were more difficult for the non-native group, while the performance for the non-target pairs was similar for both native and non-native groups.

Read full abstract

Dyslexic children exhibit great difficulties in acquiring reading skills, despite adequate intelligence and instruction, and in the absence of any obvious neurological or sensory disorders. Both phonological and surface dyslexics are impaired in phonological skills (Sprenger-Charolles, Col! e, Lacert, & Serniclaes, 2000), but the origin of these disabilities is controversial. According to the ‘rapid processing hypothesis’, phonological impairments in dyslexia stem from an auditory deficit in the processing of brief and/or rapidly changing acoustic events, which compromises phoneme discrimination, and the acquisition of metaphonological skills and grapheme–phoneme correspondence rules (Nagarajan et al., 1999; Tallal, 1980). According to the ‘linguistic hypothesis’, auditory deficits and language disorders may be associated but are not causally related (Cornelissen, Hansen, Hutton, Evangelinou, & Stein, 1998; Nittrouer, 1999; Rosen, 2003). The evidence for this view is twofold. First, many studies demonstrated the existence of a speech-specific impairment in dyslexia (Mody, Studdert-Kennedy, & Brady, 1997; Rosen & Manganari, 2001). Second, the processing of short and/or rapidly varying acoustic signals may not be the fundamental problem in dyslexia (Bradlow et al., 1999). Intensive training with artificially slowed speech did not improve reading and phonemic awareness, compared with training on unmodified speech (Rey, De Martino, Espesser, & Habib, 2002). Whatever the origin of linguistic disorders in dyslexic children, it is generally agreed that the core deficit is phonological. In this paper, we ask which phonological mechanisms and which aspects of phonological knowledge are impaired in dyslexia. We assume that this impairment relates to the phonetic underpinnings of phonemic knowledge. This hypothesis is supported by recent experiments on the effect of phonetic similarity in reading. ARTICLE IN PRESS

Read full abstract

Unmodified Speech Research Articles

Related Topics

Articles published on Unmodified Speech

The impact of speech type on listening effort and intelligibility for native and non-native listeners.

Dialect and gender perception in relation to the intelligibility of low-pass and high-pass filtered spontaneous speecha).

Combining spectral and temporal modification techniques for speech intelligibility enhancement

Acoustical and perceptual study of voice disguise by age modification in speaker verification

日本語とブラジル・ポルトガル語の外国語訛り : 加工音声と原音声の知覚的評価( 正常な発話と逸脱した発話)

Effects of linear and nonlinear speech rate changes on speech intelligibility in stationary and fluctuating maskers

Evaluating the intelligibility benefit of speech modifications in known noise conditions

Modified Spectral Tilt Affects Older, but Not Younger, Infants' Native-Language Fricative Discrimination

Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments

Reading & Writing

Effectiveness of computerised spelling training in children with language impairments: a comparison of modified and unmodified speech input

Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids

Speech Intelligibility During Respirator Wear: Influences of Respirator Speech Diaphragm Size and Background Noise

Sensitivity to voicing similarity in printed stimuli: effect of a training programme in dyslexic children

Comparative Loudness of Highpass Filtered Speech

Discrimination of filtered-clipped speech by hearing-impaired subjects.

Preprocessing of an Already Noisy Speech Signal for Intelligibility Enhancement

Discrimination of Filtered-Clipped Speech by Sensorineural Hearing-Impaired Subjects

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Unmodified Speech Research Articles

Related Topics

Articles published on Unmodified Speech

The impact of speech type on listening effort and intelligibility for native and non-native listeners.

Dialect and gender perception in relation to the intelligibility of low-pass and high-pass filtered spontaneous speecha).

Combining spectral and temporal modification techniques for speech intelligibility enhancement

Acoustical and perceptual study of voice disguise by age modification in speaker verification

日本語とブラジル・ポルトガル語の外国語訛り : 加工音声と原音声の知覚的評価( 正常な発話と逸脱した発話)

Effects of linear and nonlinear speech rate changes on speech intelligibility in stationary and fluctuating maskers

Evaluating the intelligibility benefit of speech modifications in known noise conditions

Modified Spectral Tilt Affects Older, but Not Younger, Infants' Native-Language Fricative Discrimination

Applied principles of clear and Lombard speech for automated intelligibility enhancement in noisy environments

Reading &amp; Writing

Effectiveness of computerised spelling training in children with language impairments: a comparison of modified and unmodified speech input

Physiological assessment of contrast-enhancing frequency shaping and multiband compression in hearing aids

Speech Intelligibility During Respirator Wear: Influences of Respirator Speech Diaphragm Size and Background Noise

Sensitivity to voicing similarity in printed stimuli: effect of a training programme in dyslexic children

Comparative Loudness of Highpass Filtered Speech

Discrimination of filtered-clipped speech by hearing-impaired subjects.

Preprocessing of an Already Noisy Speech Signal for Intelligibility Enhancement

Discrimination of Filtered-Clipped Speech by Sensorineural Hearing-Impaired Subjects

Reading & Writing