CASA: A Compact and Scalable Accelerator for Approximate Homomorphic Encryption

Pengzhou He,Tianyou Bao,Jiafeng Xie,Samira Carolina Oliva Madrigal,Çetin Kaya Koç

doi:10.46586/tches.v2024.i2.451-480

Abstract

Approximate arithmetic-based homomorphic encryption (HE) scheme CKKS [CKKS17] is arguably the most suitable one for real-world data-privacy applications due to its wider computation range than other HE schemes such as BGV [BGV14], FV and BFV [Bra12, FV12]. However, the most crucial homomorphic operation of CKKS called key-switching induces a great amount of computational burden in actual deployment situations, and creates scalability challenges for hardware acceleration. In this paper, we present a novel Compact And Scalable Accelerator (CASA) for CKKS on the field-programmable gate array (FPGA) platform. The proposed CASA addresses the aforementioned computational and scalability challenges in homomorphic operations, including key-exchange, homomorphic multiplication, homomorphic addition, and rescaling.On the architecture layer, we propose a new design methodology for efficient acceleration of CKKS. We design this novel hardware architecture by carefully studying the homomorphic operation patterns and data dependency amongst the primitive oracles. The homomorphic operations are efficiently mapped into an accelerator with simple control and smooth operation, which brings benefits for scalable implementation and enhanced pipeline and parallel processing (even with the potential for further improvement).On the component layer, we carry out a detailed and extensive study and present novel micro-architectures for primitive function modules, including memory bank, number theoretic transform (NTT) module, modulus switching bank, and dyadic multiplication and accumulation.On the arithmetic layer, we develop a new partially reduction-free modular arithmetic technique to eliminate part of the reduction cost over different prime moduli within the moduli chain of the Residue Number System (RNS). The proposed structure can support arbitrary numbers of security primes of CKKS during key exchange, which offers better security options for adopting the scalable design methodology.As a proof-of-concept, we implement CASA on the FPGA platform and compare it with state-of-the-art designs. The implementation results showcase the superior performance of the proposed CASA in many aspects such as compact area, scalable architecture, and overall better area-time complexities.In particular, we successfully implement CASA on a mainstream resource-constrained Artix-7 FPGA. To the authors’ best knowledge, this is the first compact CKKS accelerator implemented on an Artix-7 device, e.g., CASA achieves a 10.8x speedup compared with the state-of-the-art CPU implementations (with power consumption of only 5.8%). Considering the power-delay product metric, CASA also achieves 138x and 105x improvement compared with the recent GPU implementation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CASA: A Compact and Scalable Accelerator for Approximate Homomorphic Encryption

Abstract

Talk to us

Similar Papers

More From: IACR Transactions on Cryptographic Hardware and Embedded Systems

Lead the way for us

Journal: IACR Transactions on Cryptographic Hardware and Embedded Systems	Publication Date: Mar 12, 2024
License type: CC BY 4.0

Similar Papers

Accelerating Decision Tree Based Traffic Classification on FPGA and Multicore Platforms
Da Tong ... Yun Rock Qu
IEEE Transactions on Parallel and Distributed Systems | VOL. 28
Da Tong, et. al.Da Tong ... Yun Rock Qu
01 Nov 2017
IEEE Transactions on Parallel and Distributed Systems | VOL. 28

FPGA-Flux Proprietary System for Online Detection of Outer Race Faults in Bearings
Jonathan Cureño-Osornio ... Luis Morales-Velazquez
Electronics | VOL. 12
Jonathan Cureño-Osornio, et. al.Jonathan Cureño-Osornio ... Luis Morales-Velazquez
19 Apr 2023
Electronics | VOL. 12

Optimizing Residue Number System on FPGA
Jiahe Liu ... Bangtian Liu
-
Jiahe Liu, et. al.Jiahe Liu ... Bangtian Liu
01 Dec 2016
01 Dec 2016

FPGA Acceleration of 3GPP Channel Model Emulator for 5G New Radio
Nasir Ali Shah ... Luciano Lavagno
IEEE Access | VOL. 10
Nasir Ali Shah, et. al.Nasir Ali Shah ... Luciano Lavagno
01 Jan 2021
IEEE Access | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CASA: A Compact and Scalable Accelerator for Approximate Homomorphic Encryption

Abstract

Talk to us

Similar Papers

More From: IACR Transactions on Cryptographic Hardware and Embedded Systems