Vandermonde transform using the orthogonal basis is a time-frequency transform method that can describe a speech and audio signal to be uncorrelated and sparse, and can be considered to be also effective in speech processing. In particular, when applied to the residual considered to be effective in a pitch estimation of speech and speaker recognition. On the other hand, we have already proposed Time-Varying Complex AR (TV-CAR) speech analysis for an analytic signal. The TV-CAR analysis can improve spectral estimation accuracy in the low frequencies and it can estimate the complex AR coefficients for each sample. As a result, in the estimated complex residual signal, formant components are more subtracted from speech signal. It can perform better on pitch estimation, or so on. In this paper, we evaluated the time-frequency analysis using the Vandermonde transform based on an Orthogonal Matching Pursuit (OMP) method of the literature using the complex AR residual. The performance is evaluated for several orthogonal basis such as Fourier basis, Cos basis, K-L transform, and Gabor basis. Furthermore, the Ramanujan-sum is also introduced as the basis function to realize time-frequency transform.
Read full abstract