Abstract

Some problems in speaker identification procedures were examined: transformation of acoustic parameters into auditory scales, invalid measurement values, and comparability of spectral energy values across the frequency range. To resolve those problems, the acoustic spectral energy of three Korean numbers produced by ten female students from narrow-band spectrograms at 19 proportional time points of each voiced segment were analyzed. Then, cells of the first five spectral matrices were averaged to form a matrix model for each speaker. The correlation coefficients and sum of the absolute amplitude difference in each pair of the spectral models of the ten subjects were obtained. Also, some individual matrix models were compared to those of the same subject or the other subject with a similar spectral model. Results showed that in numbers ‘‘2’’ and ‘‘9’’ subjects could not be clearly distinguished from the others but in number ‘‘4’’ it shed some possibility of setting threshold values for speaker identification if the coefficients and the sum of absolute difference were employed. Further studies would be desirable on various combinations of the range of long-term average spectra and the degree of signal pre-emphasis. [Work supported by grant No. R01-1999-000-00229-0 from the Korea Science & Engineering Foundation.]

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.