Speech Sanitizer: Speech Content Desensitization and Voice Anonymization

Jianwei Qian,Linlin Chen,Haohua Du,Taeho Jung,Xiang-Yang Li,Jiahui Hou

doi:10.1109/tdsc.2019.2960239

Abstract

Voice input users’ speech recordings are being collected by service providers and shared with third parties, who may abuse users’ voiceprints, identify them by voice, and learn their sensitive speech content. In this work, we design <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Speech Sanitizer</i> to perturb users’ speech recordings so that the sanitized speech can be safely shared with third parties. First, we desensitize speech content by identifying sensitive words, localizing them in the audio using DTW-based keyword spotting, and substituting them with safe words. Both common and personalized sensitive words are identified and replaced. Then, we anonymize users’ voiceprints with a carefully designed voice conversion mechanism that is resistant to de-anonymization attacks. Meanwhile, we try to preserve the utility of the sanitized speech, measured by the accuracy of speech recognition performed on it. We implement Speech Sanitizer and present extensive experimental results that validate the effectiveness and efficiency of our algorithms. It is demonstrated that we are able to reduce the chance of a user's voice being identified from 50 people by 83.7 percent while keeping the drop of speech recognition accuracy within 19.1 percent. We can also easily relax the privacy level to improve speech recognition accuracy.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Transactions on Dependable and Secure Computing	Publication Date: Dec 26, 2019
Citations: 10	License type: publisher-specific, author manuscript

R Discovery Prime

R Discovery Prime

Speech Sanitizer: Speech Content Desensitization and Voice Anonymization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing

Lead the way for us

Similar Papers

SpeechHide: A Hybrid Privacy-preserving Mechanism for Speech Content and Voiceprint in Speech Data Sharing
Yu Hu ... Zhe Sun
-
Yu Hu, et. al.Yu Hu ... Zhe Sun
01 Jul 2022
01 Jul 2022

Hidebehind
Jianwei Qian ... Taeho Jung
-
Jianwei Qian, et. al.Jianwei Qian ... Taeho Jung
04 Nov 2018
04 Nov 2018

Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems
Dmitry Ryumin ... Alexandr Axyonov
Expert systems with applications | VOL. 252
Dmitry Ryumin, et. al.Dmitry Ryumin ... Alexandr Axyonov
09 May 2024
Expert systems with applications | VOL. 252

Research on Human-Computer Interaction Mode of Speech Recognition Based on Environment Elements of Command and Control System
Ning Li ... Tuoyang Zhou
-
Ning Li, et. al.Ning Li ... Tuoyang Zhou
01 Jul 2019
01 Jul 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech Sanitizer: Speech Content Desensitization and Voice Anonymization

Abstract

Talk to us

Similar Papers

More From: IEEE Transactions on Dependable and Secure Computing