The Proposal of Countermeasures for DeepFake Voices on Social Media Considering Waveform and Text Embedding

Yuta Yanagi,Ryohei Orihara,Tanel Alumäe,Yuichi Sei,Yasuyuki Tahara,Akihiko Ohsuga

doi:10.33166/aetic.2024.02.002

Abstract

In recent times, advancements in text-to-speech technologies have yielded more natural-sounding voices. However, this has also made it easier to generate malicious fake voices and disseminate false narratives. ASVspoof stands out as a prominent benchmark in the ongoing effort to automatically detect fake voices, thereby playing a crucial role in countering illicit access to biometric systems. Consequently, there is a growing need to broaden our perspectives, particularly when it comes to detecting fake voices on social media platforms. Moreover, existing detection models commonly face challenges related to their generalization performance. This study sheds light on specific instances involving the latest speech generation models. Furthermore, we introduce a novel framework designed to address the nuances of detecting fake voices in the context of social media. This framework considers not only the voice waveform but also the speech content. Our experiments have demonstrated that the proposed framework considerably enhances classification performance, as evidenced by the reduction in equal error rate. This underscores the importance of considering the waveform and the content of the voice when tasked with identifying fake voices and disseminating false claims.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Annals of Emerging Technologies in Computing	Publication Date: Apr 1, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Proposal of Countermeasures for DeepFake Voices on Social Media Considering Waveform and Text Embedding

Abstract

Talk to us

Similar Papers

More From: Annals of Emerging Technologies in Computing

Lead the way for us

Similar Papers

Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions
Dong Yu ... Jinyu Li
-
Dong Yu, et. al.Dong Yu ... Jinyu Li
01 Jan 2009
01 Jan 2009

Increasing the Robustness of i-vectors with Model Compensated First Order Statistics
Gökay Di̇şken ... Zekeriya Tüfekci̇
Afyon Kocatepe University Journal of Sciences and Engineering | VOL. 23
Gökay Di̇şken, et. al.Gökay Di̇şken ... Zekeriya Tüfekci̇
01 Mar 2023
Afyon Kocatepe University Journal of Sciences and Engineering | VOL. 23

Going Viral: The 3 Rs of Social Media Messaging during Public Health Emergencies.
Bhavini Patel Murthy ... Tanya Telfair Leblanc
Health security | VOL. 19
Bhavini Patel Murthy, et. al.Bhavini Patel Murthy ... Tanya Telfair Leblanc
01 Feb 2021
Health security | VOL. 19

Emotion attribute projection for speaker recognition on emotional speech
Huanjun Bao ... Thomas Fang Zheng
-
Huanjun Bao, et. al.Huanjun Bao ... Thomas Fang Zheng
27 Aug 2007
27 Aug 2007

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Proposal of Countermeasures for DeepFake Voices on Social Media Considering Waveform and Text Embedding

Abstract

Talk to us

Similar Papers

More From: Annals of Emerging Technologies in Computing