Abstract
Automatic speaker verification (ASV) systems have been used in numerous voice-controlled technologies including voice banking, automobiles, smartphones, smart homes, etc. The growth of voice-operated devices is increasing exponentially, and it is mandatory to secure the ASV systems against voice spoofing attacks such as cloned and synthetic voices. The research community has worked to develop spoofing countermeasures to improve the security of ASV systems. However, existing spoofing countermeasures have limitations to detect voice conversion, synthetic speech, and voice replay attacks. There still exists a need for the development of voice spoofing detectors to further strengthen the security of voice-controlled systems. Therefore, in this study, we designed a novel spoofing countermeasure for identifying physical access (PA) and logical access (LA) attacks. More precisely, we proposed a novel acoustic feature descriptor, namely, modified local ternary patterns (MLTP), and used them to train a BiLSTM classifier for voice spoofing detection. We assessed the performance of our approach using the standard ASVspoof-2019 dataset and attained an equal error rate (EER) of 1.87% and 2.66% for the PA and LA attacks, respectively. Experimental results obtained from a detailed assessment signify that our approach has the capability to accurately classify the bonafide and spoofed audio samples over contemporary methods.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have