MERGE: Fast Private Text Generation

Zi Liang,Haitao Bai,Shuo Zhang,Lifeng Xing,Pinghui Wang,Ziyang Zhou,Nuo Xu,Ruofei Zhang

doi:10.1609/aaai.v38i18.29964

Abstract

The drastic increase in language models' parameters has led to a new trend of deploying models in cloud servers, raising growing concerns about private inference for Transformer-based models. Existing two-party privacy-preserving techniques, however, only take into account natural language understanding (NLU) scenarios. Private inference in natural language generation (NLG), crucial for applications like translation and code completion, remains underexplored. In addition, previous privacy-preserving techniques suffer from convergence issues during model training and exhibit poor inference speed when used with NLG models due to the neglect of time-consuming operations in auto-regressive generations. To address these issues, we propose a fast private text generation framework for Transformer-based language models, namely MERGE. MERGE reuses the output hidden state as the word embedding to bypass the embedding computation and reorganize the linear operations in the Transformer module to accelerate the forward procedure. Extensive experiments show that MERGE achieves a 26.5x speedup to the vanilla encrypted model under the sequence length 512, and reduces 80% communication cost, with an up to 10x speedup to state-of-the-art approximated models.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

MERGE: Fast Private Text Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Усвоение языка у языковых моделей и человека: хронологическое пробинг-исследование
Ekaterina Voloshina ... Oleg Serikov
-
Ekaterina Voloshina, et. al.Ekaterina Voloshina ... Oleg Serikov
18 Jun 2022
18 Jun 2022

Comparative Study of Multiclass Text Classification in Research Proposals Using Pretrained Language Models
Eunchan Lee ... Sangtae Ahn
Applied Sciences | VOL. 12
Eunchan Lee, et. al.Eunchan Lee ... Sangtae Ahn
29 Apr 2022
Applied Sciences | VOL. 12

Transformer-based Natural Language Understanding and Generation
Feng Zhang ... Gaoyun An
-
Feng Zhang, et. al.Feng Zhang ... Gaoyun An
21 Oct 2022
21 Oct 2022

NLP and its Components: A Detailed Discussion
Zohaib Hasan ... Nidhi Pateriya
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 11
Zohaib Hasan, et. al.Zohaib Hasan ... Nidhi Pateriya
25 Nov 2023
International Journal of Innovative Research in Computer and Communication Engineering | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

MERGE: Fast Private Text Generation

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence