Abstract

The constant-Q cepstral coefficients (CQCC) feature is one of the most effective feature in the field of spoof speech detection. The extraction of this feature involves constant-Q transform that captures long range information from the signal. It is followed by uniform resampling of the octave power spectrum to have linear power spectrum from which the CQCC features are obtained. However, we hypothesize that the information obtained from octave power spectrum is complementary with that captured by the linear spectrum. In this regard, we propose to combine the coefficients generated using both linear and octave power spectrum. The combined feature is referred to as extended CQCC (eCQCC) which is hypothesized to have better discriminative information for detection of spoof attacks. The studies for spoof detection are conducted on both synthetic voice and replay based spoofing attacks using ASVspoof 2015 and ASVspoof 2017 Version 2.0 database, respectively. The studies confirm that the proposed eCQCC feature consistently outperforms the baseline CQCC feature in all tasks.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.