Abstract
Commonly used robust speaker verification systems are based on time-varying autoregressive spectral estimation (AR) combined with hidden Markov modeling (HMM) or dynamic time warping (DTW). An exhaustive optimization of these methods in the past has culminated in quite reliable verification schemes. It seems unlikely, though, that further significant improvements are readily obtained along the same path. Unlike time-varying AR-modeling, which focuses on the the global spectral structure of an utterance, we are introducing a new method that focuses on the local time-varying spectral structure of individual pitch periods. Additionally, a pattern classification method using singular value decomposition (SVD) is employed. The new method by itself does not deliver better results than commonly used global methods; however, it is shown that an acceptance/rejection decision derived from both global and local analysis greatly improves the performance of the verification system.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.