It has long been established that LPC analysis results in formant estimates that are not accurate representations of the resonances; they are biased towards the nearest harmonic, and this bias worsens as F0 rises to 200 Hz or more. Manual measurement of formants with the reassigned spectrogram (RS) has been shown to be more accurate, but the ground truth is needed in order to test automatic measurement methods. Here, “vocal tract” models were 3D-printed to allow resonances to be excited replicably. Using the principle of acoustic reciprocity, source signals were played over an external loudspeaker, and the output filtered by the model was recorded via a microphone located in the model’s “glottis.” A white noise source signal was used to determine the resonances. Sawtooth and impulse train source signals (F0s 83–400 Hz) were filtered by the physical model and then analyzed with both LPC and RS. LPC formants were biased towards the nearest harmonic, resulting in errors from 10 Hz at low F0 to 250 at 300 Hz. RS errors ranged from 0 to 20 Hz and were not correlated with F0. These results indicate physical models are useful for creating known resonances for validating accuracy in our measurements.
Read full abstract