Speaker identification by difference sum and correlation coefficients of narrow-band spectrum

Byunggon Yang,Sunmee Kang

doi:10.1121/1.4780185

Abstract

Some problems in speaker identification procedures were examined: transformation of acoustic parameters into auditory scales, invalid measurement values, and comparability of spectral energy values across the frequency range. To resolve those problems, the acoustic spectral energy of three Korean numbers produced by ten female students from narrow-band spectrograms at 19 proportional time points of each voiced segment were analyzed. Then, cells of the first five spectral matrices were averaged to form a matrix model for each speaker. The correlation coefficients and sum of the absolute amplitude difference in each pair of the spectral models of the ten subjects were obtained. Also, some individual matrix models were compared to those of the same subject or the other subject with a similar spectral model. Results showed that in numbers ‘‘2’’ and ‘‘9’’ subjects could not be clearly distinguished from the others but in number ‘‘4’’ it shed some possibility of setting threshold values for speaker identification if the coefficients and the sum of absolute difference were employed. Further studies would be desirable on various combinations of the range of long-term average spectra and the degree of signal pre-emphasis. [Work supported by grant No. R01-1999-000-00229-0 from the Korea Science & Engineering Foundation.]

Full Text