Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Yuyang Wang,Xianjia Meng,Ximeng Liu

doi:10.1007/s11036-023-02096-9

Yuyang Wang, Xianjia Meng + Show 1 more

Open Access

https://doi.org/10.1007/s11036-023-02096-9

Copy DOI

Abstract

Deep learning techniques have been widely used in natural language processing (NLP) tasks and have made remarkable progress. However, training the deep learning model relies on a large amount of data which may involve sensitive information like electronic medical records. The attacker can infer sensitive information from the model, which leads to privacy leakage. To solve this problem, we propose a Differentially Private Recurrent Variational AutoEncoder (DP-RVAE) that can generate simulated data in place of the sensitive dataset to preserve privacy. To generate high utility synthetic text, a part of sensitive text data is employed as the conditional input of the model and uses a dropout and noise perturbing mechanism to preserve differential privacy. In addition, we expand the proposed DP-RVAE to a federated learning setting and design a novel training paradigm for NLP tasks. Specifically, DP-RVAE is deployed to the client-side to train and generate personalized text. These DP-RVAE models would be aggregated and updated through the Federated Optimisation (FedOPT) algorithm so that personal information can be well preserved. We evaluate our proposed DP-RVAE through a text classification task on the Tweets depression sentiment and IMDB reviews datasets. Our DP-RVAE achieves a higher average test accuracy by 5.90% and 3.94% compared to the typical centralized training and federated learning approach, respectively. We also perform the keywords inference attack experiment on the medical description dataset collected from the real world. Compared to the typical differentially private preserving approach, the DP-RVAE decreases by 15.2% in average attack accuracy. The experimental results demonstrate that DP-RVAE can be applied to the NLP models to leverage accuracy while preserving sensitive privacy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Mobile Networks and Applications	Publication Date: Jun 14, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Abstract

Talk to us

Similar Papers

More From: Mobile Networks and Applications

Lead the way for us

Similar Papers

Deep CNN with Residual Connections and Range Normalization for Clinical Text Classification
Jonah K Kenei ... Elisha T Opiyo Omullo
Computer Science and Information Technology | VOL. 7
Jonah K Kenei, et. al.Jonah K Kenei ... Elisha T Opiyo Omullo
01 Aug 2019
Computer Science and Information Technology | VOL. 7

Learned Text Representation for Amharic Information Retrieval and Natural Language Processing
Tilahun Yeshambel ... Josiane Mothe
Information | VOL. 14
Tilahun Yeshambel, et. al.Tilahun Yeshambel ... Josiane Mothe
20 Mar 2023
Information | VOL. 14

Deep Learning Approaches for Affective Computing in Text
Ramón Zatarain Cabada ... Víctor Manuel Bátiz Beltrán
-
Ramón Zatarain Cabada, et. al.Ramón Zatarain Cabada ... Víctor Manuel Bátiz Beltrán
21 Dec 2023
21 Dec 2023

Natural Language Processing using Deep Learning in Social Media
María Teresa Giménez Fayos
-
María Teresa Giménez FayosMaría Teresa Giménez Fayos
02 Sep 2021
02 Sep 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Differentially Private Recurrent Variational Autoencoder For Text Privacy Preservation

Abstract

Talk to us

Similar Papers

More From: Mobile Networks and Applications