The standard autocorrelation (AC) model of pitch perception is capable of predicting a wide range of pitch phenomena, but there are a number of pitch-shift effects that it fails to predict. One of these is the pitch shift associated with a single mistuned harmonic, another noise edge pitch (NEP). A recent extension of the model by Hartmann et al. [J. Acoust. Soc. Am. 145, 1993–2008 (2019)] employs a method that considers multiple peaks in the AC function, enabling the prediction of NEP. This extension, however, reduces the ability of the model to predict mistuned harmonic pitch. It can be shown that an additional extension to the AC model, which modifies the peak selection process, is capable of predicting both types of pitch shift. Specifically, a delay time is assigned to a peak of the AC function in a manner that takes into account the entire shape of the bump surrounding a local maximum. The end result is closely related to a template fitting procedure in the frequency domain, but it is suggested that such a procedure could correspond to processes in the time domain. The neuronal dynamics that support synchrony enhancement are suggested as a possible mechanism.
Read full abstract