Robust feature extraction for speaker recognition based on constrained nonnegative tensor factorization

Qiang Wu ,Liqing Zhang ,Guangchuan Shi

doi:10.1007/s11390-010-1061-z

Qiang Wu , Liqing Zhang + Show 1 more

https://doi.org/10.1007/s11390-010-1061-z

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

How to extract robust feature is an important research topic in machine learning community. In this paper, we investigate robust feature extraction for speech signal based on tensor structure and develop a new method called constrained Nonnegative Tensor Factorization (cNTF). A novel feature extraction framework based on the cortical representation in primary auditory cortex (AI) is proposed for robust speaker recognition. Motivated by the neural firing rates model in AI, the speech signal first is represented as a general higher order tensor. cNTF is used to learn the basis functions from multiple interrelated feature subspaces and find a robust sparse representation for speech signal. Computer simulations are given to evaluate the performance of our method and comparisons with existing speaker recognition methods are also provided. The experimental results demonstrate that the proposed method achieves higher recognition accuracy in noisy environment.

Full Text