Privacy-preserving Model Training for Disease Prediction Using Federated Learning with Differential Privacy.

Amol Khanna,Mark Gerstein,Gamze Gursoy,Vincent Schaffer

doi:10.1109/embc48229.2022.9871742

Abstract

Machine learning is playing an increasingly critical role in health science with its capability of inferring valuable information from high-dimensional data. More training data provides greater statistical power to generate better models that can help decision-making in healthcare. However, this often requires combining research and patient data across institutions and hospitals, which is not always possible due to privacy considerations. In this paper, we outline a simple federated learning algorithm implementing differential privacy to ensure privacy when training a machine learning model on data spread across different institutions. We tested our model by predicting breast cancer status from gene expression data. Our model achieves a similar level of accuracy and precision as a single-site non-private neural network model when we enforce privacy. This result suggests that our algorithm is an effective method of implementing differential privacy with federated learning, and clinical data scientists can use our general framework to produce differentially private models on federated datasets. Our framework is available at https://github.com/gersteinlab/idash20FL.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Privacy-preserving Model Training for Disease Prediction Using Federated Learning with Differential Privacy.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference

Lead the way for us

Journal: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference	Publication Date: Jul 11, 2022
Citations: 4

Similar Papers

Emerging Marketing Research on Healthcare and Medical Decision Making: Toward a Consumer-Centric and Pluralistic Methodological Perspective
Meng Zhu ... Jian Ni
Journal of the Association for Consumer Research | VOL. 7
Meng Zhu, et. al.Meng Zhu ... Jian Ni
18 Mar 2022
Journal of the Association for Consumer Research | VOL. 7

Clinical epidemiology and individualized medicine
Robin Henderson ... Martin Schumacher
Biometrical Journal | VOL. 53
Robin Henderson, et. al.Robin Henderson ... Martin Schumacher
11 Feb 2011
Biometrical Journal | VOL. 53

High-dimensional data: p >> n in mathematical statistics and bio-medical applications
Sara A Van De Geer ... Hans C Van Houwelingen
Bernoulli | VOL. 10
Sara A Van De Geer, et. al.Sara A Van De Geer ... Hans C Van Houwelingen
01 Dec 2004
Bernoulli | VOL. 10

The Effect of Principal Component Analysis on Machine Learning Accuracy with High Dimensional Spectral Data
Tom Howley ... Alan G. Ryder
-
Tom Howley, et. al.Tom Howley ... Alan G. Ryder
12 Dec 2005
12 Dec 2005

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Privacy-preserving Model Training for Disease Prediction Using Federated Learning with Differential Privacy.

Abstract

Talk to us

Similar Papers

More From: Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference