Abstract

Genetic matching in personalized medicine is becoming more popular in cloud computing, whereby a cloud server performs genetic matching from the genetic data outsourced by a gene provider (e.g., a genetic lab) and an authorized party (e.g., a doctor) for diagnosing the patients’ diseases. Due to sensitive privacy, we should protect genetic data before outsourcing it to the untrusted cloud. However, traditional differential privacy schemes do not support genetic matching and ciphertext methods hinder data availability. In this article, we propose a differential privacy-based genetic matching (DPGM) scheme to achieve effective genetic matching and protect genetic privacy. Specifically, DPGM first uses a DP-based EIGENSTRAT (DPE) algorithm to construct a published sequence that contains significantly noisy single-nucleotide polymorphisms (SNPs) associated with diseases, thereby ensuring outsourced genetic data privacy. Second, DPGM adopts a DP-based N-order Markov (DPNM) algorithm to generate a noisy query sequence, which considers query privacy and the similarity between the noisy query and the actual query. Finally, DPGM calculates the longest common subsequence (LCS) based on a dynamic programming algorithm, which achieves effective matching results. Detailed theoretical analysis proves that our DPGM scheme achieves <inline-formula><tex-math notation="LaTeX">$\epsilon$</tex-math></inline-formula> -differential privacy. Extensive experiments over actual genetic datasets demonstrate that our scheme achieves high efficiency and data utility.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call