We use a classification tool previously tested on Romanian fricatives to categorize the front (non-sibilant) fricatives of English by place of articulation. Labiodental and interdental fricatives are difficult to distinguish acoustically, posing problems even for human perception. Prior classification work with English front fricatives has not been very successful with this contrast, with correct classification rates ranging from 40 to 60%. The feature set we use for coding the acoustic properties of the fricatives and their following vowels comprise the first six cepstral coefficients (c0–c5). The acoustic features are measured at 10-ms intervals across each segment; the measures obtained are then binned into three contiguous intervals for both the fricative and the vowel, representing the onset, steady state, and offset of each segment. The boundaries between regions are set by using a hidden Markov model to determine three internally uniform regions with respect to their acoustic properties. The mean value of each acoustic feature within each region is obtained; thus, each CV production yields six measurements per coefficient. We are testing this model on fricatives from the TIMIT corpus and classifying them using multinomial logistic regression models. While this investigation is underway, we expect higher correct classification rates than previous work.
Read full abstract