Abstract

Automatic speaker gender identification based on the speech feature has important application in the audio processing and analysis field. In order to overcome the conventional linear parameters in the speaker feature lack of gender characteristics, in this paper, nonlinear parameters such as the fractal dimension and fractal complexity as feature space effective compensations are presented. Firstly, use lifting scheme to extract pitch; Then extract the speech fractal dimension; Finally, according Takens theorem, time delay method is used to reconstruct phase space of fractal dimension sequence, fractal dimension complexity is obtained by calculating Approximate Entropy. Three dimension feature vectors constructed by the pitch, the fractal dimension and the fractal dimension complexity are applied to speaker gender identification. Results show as the identification system based on the new method introduces the non-linear parameters, its accuracy and stability are effectively improved compared with the traditional linear method identification systems. The new nonlinear method provides new ideas for speaker gender identification of a new line of thought.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call