Abstract

Researchers investigating the vocalic systems of languages or dialects frequently employ normalization methods to minimize between-speaker variability in formant patterns while largely preserving dialectal, between-phoneme variability. One popular method, log-mean normalization, relies on estimating the logarithm of the geometric mean formant-frequency (Gs) produced by a speaker across their vocalic inventory, and then expressing the formant patterns produced by a speaker as deviations from this mean. However, in the face of missing or unbalanced data, the traditional approach to calculating Gs for a speaker will usually lead to biased estimates, which will produce artificial asymmetries in the normalized vowel spaces of different speakers. An alternative method is proposed for the estimation of Gs based on a linear-regression framework, which avoids the biases associated with traditional estimation of Gs when data is unbalanced. The regression method to normalization is described, and simulations are carried out to compare the accuracy of Gs estimates via regression to other estimation methods. Results indicate that the proposed method is substantially more accurate than the traditional approach to estimating Gs in the face of missing or unbalanced data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.