Web-Based Privacy-Preserving Multicenter Medical Data Analysis Tools Via Threshold Homomorphic Encryption: Design and Development Study.

Yao Lu,Tianshu Zhou,Yu Tian,Shiqiang Zhu,Jingsong Li

doi:10.2196/22555

Yao Lu, Tianshu Zhou + Show 3 more

Open Access

https://doi.org/10.2196/22555

Copy DOI

Abstract

BackgroundData sharing in multicenter medical research can improve the generalizability of research, accelerate progress, enhance collaborations among institutions, and lead to new discoveries from data pooled from multiple sources. Despite these benefits, many medical institutions are unwilling to share their data, as sharing may cause sensitive information to be leaked to researchers, other institutions, and unauthorized users. Great progress has been made in the development of secure machine learning frameworks based on homomorphic encryption in recent years; however, nearly all such frameworks use a single secret key and lack a description of how to securely evaluate the trained model, which makes them impractical for multicenter medical applications.ObjectiveThe aim of this study is to provide a privacy-preserving machine learning protocol for multiple data providers and researchers (eg, logistic regression). This protocol allows researchers to train models and then evaluate them on medical data from multiple sources while providing privacy protection for both the sensitive data and the learned model.MethodsWe adapted a novel threshold homomorphic encryption scheme to guarantee privacy requirements. We devised new relinearization key generation techniques for greater scalability and multiplicative depth and new model training strategies for simultaneously training multiple models through x-fold cross-validation.ResultsUsing a client-server architecture, we evaluated the performance of our protocol. The experimental results demonstrated that, with 10-fold cross-validation, our privacy-preserving logistic regression model training and evaluation over 10 attributes in a data set of 49,152 samples took approximately 7 minutes and 20 minutes, respectively.ConclusionsWe present the first privacy-preserving multiparty logistic regression model training and evaluation protocol based on threshold homomorphic encryption. Our protocol is practical for real-world use and may promote multicenter medical research to some extent.

Highlights

BackgroundIn recent years, researchers have proposed strong requirements for the quality of medical research as it continues to progress, which has promoted the development of multicenter research
The least squares approximation function is integerized to be compatible with the homomorphic encryption computation: The integerized function output is transformed into an original function: We describe the detailed process of secure logistic regression
We propose the first privacy-preserving multiparty logistic regression model training and evaluation protocol based on threshold homomorphic encryption

Summary

Introduction

BackgroundIn recent years, researchers have proposed strong requirements for the quality of medical research as it continues to progress, which has promoted the development of multicenter research. Many medical institutions are unwilling to share their data despite the aforementioned benefits, which hinders the collaborative benefits of multicenter research To solve this problem, a framework is urgently needed to support multicenter medical research efficiently while preventing the leakage of sensitive information. Data sharing in multicenter medical research can improve the generalizability of research, accelerate progress, enhance collaborations among institutions, and lead to new discoveries from data pooled from multiple sources. Despite these benefits, many medical institutions are unwilling to share their data, as sharing may cause sensitive information to be leaked to researchers, other institutions, and unauthorized users. Our protocol is practical for real-world use and may promote multicenter medical research to some extent

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Medical Internet Research	Publication Date: Dec 8, 2020
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Web-Based Privacy-Preserving Multicenter Medical Data Analysis Tools Via Threshold Homomorphic Encryption: Design and Development Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Medical Internet Research

Lead the way for us

Similar Papers

High performance of privacy-preserving acute myocardial infarction auxiliary diagnosis based on federated learning: a multicenter retrospective study.
Jie Xu ... Huamin Yu
Annals of Translational Medicine | VOL. 10
Jie Xu, et. al.Jie Xu ... Huamin Yu
01 Sep 2022
Annals of Translational Medicine | VOL. 10

Computing Blindfolded on Data Homomorphically Encrypted under Multiple Keys: A Survey
Asma Aloufi ... Peizhao Hu
ACM Computing Surveys | VOL. 54
Asma Aloufi, et. al.Asma Aloufi ... Peizhao Hu
08 Oct 2021
ACM Computing Surveys | VOL. 54

Multicenter Privacy-Preserving Cox Analysis Based on Homomorphic Encryption.
Yao Lu ... Yu Tian
IEEE Transactions on Information Technology in Biomedicine | VOL. 25
Yao Lu, et. al.Yao Lu ... Yu Tian
06 Apr 2021
IEEE Transactions on Information Technology in Biomedicine | VOL. 25

NN-EMD: Efficiently Training Neural Networks Using Encrypted Multi-Sourced Datasets
Runhua Xu ... James Joshi
IEEE Transactions on Dependable and Secure Computing | VOL. 19
Runhua Xu, et. al.Runhua Xu ... James Joshi
02 Apr 2021
IEEE Transactions on Dependable and Secure Computing | VOL. 19

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Web-Based Privacy-Preserving Multicenter Medical Data Analysis Tools Via Threshold Homomorphic Encryption: Design and Development Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Medical Internet Research