Voice Presentation Attacks Detection using Acoustic MLTP Features and BiLSTM

Sundas Ibrar,Ali Javed,Hafsa Ilyas

doi:10.1109/c-code58145.2023.10139903

Sundas Ibrar, Ali Javed + Show 1 more

https://doi.org/10.1109/c-code58145.2023.10139903

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Automatic speaker verification (ASV) systems have been used in numerous voice-controlled technologies including voice banking, automobiles, smartphones, smart homes, etc. The growth of voice-operated devices is increasing exponentially, and it is mandatory to secure the ASV systems against voice spoofing attacks such as cloned and synthetic voices. The research community has worked to develop spoofing countermeasures to improve the security of ASV systems. However, existing spoofing countermeasures have limitations to detect voice conversion, synthetic speech, and voice replay attacks. There still exists a need for the development of voice spoofing detectors to further strengthen the security of voice-controlled systems. Therefore, in this study, we designed a novel spoofing countermeasure for identifying physical access (PA) and logical access (LA) attacks. More precisely, we proposed a novel acoustic feature descriptor, namely, modified local ternary patterns (MLTP), and used them to train a BiLSTM classifier for voice spoofing detection. We assessed the performance of our approach using the standard ASVspoof-2019 dataset and attained an equal error rate (EER) of 1.87% and 2.66% for the PA and LA attacks, respectively. Experimental results obtained from a detailed assessment signify that our approach has the capability to accurately classify the bonafide and spoofed audio samples over contemporary methods.

Full Text