Graduate School of Information Science and Technology, Hokkaido University,Kita 14 Nishi 9, Sapporo, 060–0814 Japan(Received 21 December 2004, Accepted for publication 7 February 2005)Keywords: Delayed auditory feedback, Transformed auditory feedback, Speech motor control, StutteringPACS number: 43.70. h [DOI: 10.1250/ast.26.358]1. IntroductionSpeech perception plays an important role in speechproduction. When we speak, we always control our voicethrough speech perception in order to adapt our speaking tounpredictable environmental changes. For example, accordingto the Lombard effect [1], pitch or loudness increases under anoisy condition. An artificial change such as a delayedauditory feedback (DAF) or a transformed auditory feedback(TAF) experiments also gives rise to various behaviors ofsubjects. It is known that in the DAF experiment, wherespeech is delayed for a short time (50–300ms) before it isreplayed to the speaker, normal subjects fall into a stuttering-like condition [2]. However, in the case of stutterers,stuttering is suppressed with DAF. In the TAF experiment,where the pitch of the feedback voice is shifted, the pitchfrequency is feedback-controlled with fast (’150ms) andslow ( 300ms) responses [3–5]. Stutterers also demonstratedifferent performance from normal subjects [6]. Thesefindings indicate that complex feedback controls, includingmultiple processing paths, play a part in speech motor controlthrough speech perception.A phenomenon called the ‘‘chorus effect,’’ which isrelated to stuttering, is known [7]. It is a phenomenon inwhich stuttering is suppressed when the stutterer speaks thesame sentences in synchrony with others. The mechanism ofthis effect is not known, however, there are some possibleexplanations for this effect. One is that the brain of stutterersforms a specific structure in the feedback control ofarticulation that does not exist in normal subjects, and it iseffective only under the condition described above. A secondsimpler explanation which is not limited to stutterers is that, ingeneral, the feedback control of articulation is activated onlyon one’s own voice; it does not function when other’s voice isinput to the auditory system. As a result, stutterers who havesome problem in their articulation feedback control system donot fall into stuttering when other’s voice is given. In the lattercase, we can predict that when a parameter of the feedbackvoice is artificially shifted sufficiently to change one’sperception of the voice, the feedback control of articulationwill be suppressed. One of the parameters is pitch. It is knownthat a frequency altered feedback (FAF) medical treatment,where the pitch of the feedback voice is shifted for stutterers,has an inhibitory effect on stuttering [8].Up to now, control models of pitch [4,5] or brainprocessing models of speech [9] have been proposed fromthe viewpoint of speech motor control. However, brainprocessing that explains the chorus effect, such as the twopossibilities above, has not been sufficiently discussed orexamined. The assumption that feedback control of articu-lation acts only on one’s own voice suggests the existence ofanother process of speech motor control that has not beenconsidered until now. Consequently, we report the results ofan experiment in which DAF and TAF were combined toexamine the nature of the interaction between articulationcontrol and pitch-shift perception. Following that, we discussthe mechanism of feedback control on articulation.2. ExperimentIt has been reported from conventional DAF experimentsthat DAF is most effective at delays under 200ms. However,if the feedback control of articulation is suppressed in propor-tion to the pitch shift, the DAF effect will be reduced inproportion to the pitch shift, independently of the delay time.2.1. Experimental methodEight healthy, male native Japanese speakers (age range22–26 years, mean SD 23:5 1:9) participated as subjects.They had no speech or voice disorders, and were not trainedas singers.Each subject’s voice was recorded with a microphone(SHURE SM58). The delay and pitch were converted throughan effector (ZOOM RFX-2000) and fed back to the subjectsthrough headphones (AKG K271S). The parameters of pitchand delay were manipulated using a PC (SONY VAIO PCV-RX72K) through MIDI. The voices were recorded on a DAT(SONY DTC-ZA5ES). Pink noise at 75dB SPL was gen-erated by an analyzer (PHONIC PAA-2) and mixed with thefeedback voice by a mixer (PHONIC PM602FX).Each subject was instructed to read a Japanese junior highschool language textbook. Two conditions were prepared.Under the normal condition, subjects read the textbook aloudwhile listening to an unaltered feedback voice throughheadphones. Under the altered condition, they read the sametext aloud while listening to an altered voice. Under thealtered condition, the sentences were divided into blocks, andeach block was allocated different delay-time and pitch-shift
Read full abstract