Speech De-identification with Deep Neural Networks

Ádám Fodor,Zoltán Ádám Milacski,László Kopácsi,András Lőrincz

doi:10.14232/actacyb.288282

Abstract

Cloud-based speech services are powerful practical tools but the privacy of the speakers raises important legal concerns when exposed to the Internet. We propose a deep neural network solution that removes personal characteristics from human speech by converting it to the voice of a Text-to-Speech (TTS) system before sending the utterance to the cloud. The network learns to transcode sequences of vocoder parameters, delta and delta-delta features of human speech to those of the TTS engine. We evaluated several TTS systems, vocoders and audio alignment techniques. We measured the performance of our method by (i) comparing the result of speech recognition on the de-identified utterances with the original texts, (ii) computing the Mel-Cepstral Distortion of the aligned TTS and the transcoded sequences, and (iii) questioning human participants in A-not-B, 2AFC and 6AFC tasks. Our approach achieves the level required by diverse applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Speech De-identification with Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Acta Cybernetica

Lead the way for us

Similar Papers

The 3-dimensional medical image recognition of right and left kidneys by deep GMDH-type neural network
Tadashi Kondo ... Junji Ueno
-
Tadashi Kondo, et. al.Tadashi Kondo ... Junji Ueno
01 Nov 2015
01 Nov 2015

Speech Recognition Based on Deep Tensor Neural Network and Multifactor Feature
Yahui Shan ... Jing Wang
-
Yahui Shan, et. al.Yahui Shan ... Jing Wang
01 Nov 2019
01 Nov 2019

The Recognition of Persian Phonemes Using PPNet.
Seyed Naser Razavi ... Mohammad Hossein Gholizadeh
Journal of Medical Signals & Sensors | VOL. 10
Seyed Naser Razavi, et. al.Seyed Naser Razavi ... Mohammad Hossein Gholizadeh
25 Apr 2020
Journal of Medical Signals & Sensors | VOL. 10

Sparse Deep Neural Network Exact Solutions
Jeremy Kepner ... Hayden Jananthan
-
Jeremy Kepner, et. al.Jeremy Kepner ... Hayden Jananthan
01 Sep 2018
01 Sep 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Speech De-identification with Deep Neural Networks

Abstract

Talk to us

Similar Papers

More From: Acta Cybernetica