A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data

Xiaoqiang Sun,Ting Wang,Jiakai Wei,Zhiwei Sun,Jie Feng,Guangwu Hu,Celimuge Wu

doi:10.1155/2021/8952219

Abstract

Based on the clinical states of the patient, dynamic treatment regime technology can provide various therapeutic methods, which is helpful for medical treatment policymaking. Reinforcement learning is an important approach for developing this technology. In order to implement the reinforcement learning algorithm efficiently, the computation of health data is usually outsourced to the untrustworthy cloud server. However, it may leak, falsify, or delete private health data. Encryption is a common method for solving this problem. But the cloud server is difficult to calculate encrypted health data. In this paper, based on Cheon et al.’s approximate homomorphic encryption scheme, we first propose secure computation protocols for implementing comparison, maximum, exponentiation, and division. Next, we design a homomorphic reciprocal of square root protocol firstly, which only needs one approximate computation. Based on the proposed secure computation protocols, we design a secure asynchronous advantage actor-critic reinforcement learning algorithm for the first time. Then, it is used to implement a secure treatment decision-making algorithm. Simulation results show that our secure computation protocols and algorithms are feasible.

Highlights

As a recent healthcare tendency, personalized medicine [1] enables the patient to obtain early diagnoses, risk estimation, optimal treatments with low costs by using molecular and cellular analysis technologies, diagnosis results, genetic information, etc
Personalized medicine is usually implemented by the dynamic treatment regime technology [2, 3], which can provide various therapeutic methods according to the time-varying clinical states of the patient
In order to acquire the ciphertexts of historical data owners and wearable devices, middle ciphertext results during the execution of privacy-preserving A3C reinforcement learning algorithm and treatment decision-making algorithm (Section 6), A∗1 eavesdrops on the communication links among the entities in the system model

Summary

Introduction

As a recent healthcare tendency, personalized medicine [1] enables the patient to obtain early diagnoses, risk estimation, optimal treatments with low costs by using molecular and cellular analysis technologies, diagnosis results, genetic information, etc. Because of the patient’s limited computation ability, health data are usually outsourced to the cloud server for implementing the reinforcement learning algorithm. Because the cloud server may be untrusted, it is likely that health data will be illegally accessed, forged, tampered, or discarded in the process of transmission and computation It may be harmful for personal privacy, economic interests, and even the security of human life. The cloud server can run the reinforcement learning algorithm on the encrypted health data perfectly by using homomorphic encryption without leaking patient privacy. We first design the homomorphic reciprocal of square root protocol, which needs only one approximate computation (2) Based on the proposed secure computation protocols, we design the secure A3C reinforcement learning algorithm for the first time We use it to implement a secure treatment decision-making algorithm (3) we simulate the proposed secure computation protocols and algorithms on the personal computer’s virtual machine.

Related Work

Preliminaries

Asynchronous Advantage Actor-Critic Reinforcement

Secure Dynamic Treatment Regimes on Health Data

Building Blocks

Privacy-Preserving Computation Algorithms

Performance Results

Conclusion

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Wireless Communications and Mobile Computing	Publication Date: Nov 23, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing

Lead the way for us

Similar Papers

A new asynchronous reinforcement learning algorithm based on improved parallel PSO
Shifei Ding ... Weikuan Jia
Applied intelligence (Dordrecht, Netherlands) | VOL. 49
Shifei Ding, et. al.Shifei Ding ... Weikuan Jia
25 May 2019
Applied intelligence (Dordrecht, Netherlands) | VOL. 49

Applications of asynchronous deep reinforcement learning based on dynamic updating weights
Xingyu Zhao ... Yuexuan An
Applied intelligence (Dordrecht, Netherlands) | VOL. 49
Xingyu Zhao, et. al.Xingyu Zhao ... Yuexuan An
07 Sep 2018
Applied intelligence (Dordrecht, Netherlands) | VOL. 49

A New Asynchronous Architecture for Tabular Reinforcement Learning Algorithms
Xingyu Zhao ... Shifei Ding
-
Xingyu Zhao, et. al.Xingyu Zhao ... Shifei Ding
17 Oct 2018
17 Oct 2018

Asynchronous reinforcement learning algorithms for solving discrete space path planning problems
Xingyu Zhao ... Weikuan Jia
Applied intelligence (Dordrecht, Netherlands) | VOL. 48
Xingyu Zhao, et. al.Xingyu Zhao ... Weikuan Jia
04 Aug 2018
Applied intelligence (Dordrecht, Netherlands) | VOL. 48

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Wireless Communications and Mobile Computing