Abstract

Speaker recognition systems mostly employ mel frequency cepstral coefficients (MFCC). Performance of these systems is generally affected by background noise, transmission medium, etc. Further, they do not perform well in text‐independent environment with limited training data. For enhanced performance, the set of parameters used should separate the speaker‐dependent information from the linguistic information. Towards this end, application of parameters of the harmonic plus noise model (HNM)‐based analysis is investigated. HNM divides the speech spectrum into harmonic and noise bands separated by a dynamically varying maximum voiced frequency. This frequency is a speaker‐dependent parameter and its estimation is not affected by moderate SNR degradation. A speaker recognition system was devised using three HNM parameters, namely, maximum voiced frequency, relative noise band energy, and pitch. It gave‐performed comparably to that of MFCC‐based recognition, for a group of ten speakers. An enhanced performan...

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.