Voice source characterization using pitch synchronous discrete cosine transform for speaker identification.

A G Ramakrishnan,B Abhiram,S R Mahadeva Prasanna

doi:10.1121/1.4921679

Voice source characterization using pitch synchronous discrete cosine transform for speaker identification.

A G Ramakrishnan, B Abhiram + Show 1 more

Open Access

https://doi.org/10.1121/1.4921679

Copy DOI

Journal: The Journal of The Acoustical Society of America	Publication Date: May 28, 2015
Citations: 24

Affiliation: Indian Institute of Science Bangalore, Indian Institute of Technology Guwahati

#Voice Source #Pitch Synchronous + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

A characterization of the voice source (VS) signal by the pitch synchronous (PS) discrete cosine transform (DCT) is proposed. With the integrated linear prediction residual (ILPR) as the VS estimate, the PS DCT of the ILPR is evaluated as a feature vector for speaker identification (SID). On TIMIT and YOHO databases, using a Gaussian mixture model (GMM)-based classifier, it performs on par with existing VS-based features. On the NIST 2003 database, fusion with a GMM-based classifier using MFCC features improves the identification accuracy by 12% in absolute terms, proving that the proposed characterization has good promise as a feature for SID studies.

Full Text