In-Silico Evaluation of Glucose Regulation Using Policy Gradient Reinforcement Learning for Patients with Type 1 Diabetes Mellitus

Jonas Nordhaug Myhre,Anas El Fathi,Miguel Tejedor,Ilkka Kalervo Launonen,Fred Godtliebsen

doi:10.3390/app10186350

Abstract

In this paper, we test and evaluate policy gradient reinforcement learning for automated blood glucose control in patients with Type 1 Diabetes Mellitus. Recent research has shown that reinforcement learning is a promising approach to accommodate the need for individualized blood glucose level control algorithms. The motivation for using policy gradient algorithms comes from the fact that adaptively administering insulin is an inherently continuous task. Policy gradient algorithms are known to be superior in continuous high-dimensional control tasks. Previously, most of the approaches for automated blood glucose control using reinforcement learning has used a finite set of actions. We use the Trust-Region Policy Optimization algorithm in this work. It represents the state of the art for deep policy gradient algorithms. The experiments are carried out in-silico using the Hovorka model, and stochastic behavior is modeled through simulated carbohydrate counting errors to illustrate the full potential of the framework. Furthermore, we use a model-free approach where no prior information about the patient is given to the algorithm. Our experiments show that the reinforcement learning agent is able to compete with and sometimes outperform state-of-the-art model predictive control in blood glucose regulation.

Highlights

Type 1 Diabetes Mellitus (T1DM) is a metabolic disease caused by the autoimmune destruction of insulin-producing beta cells in the pancreas [1]
Trust-region policy optimization (TRPO) is an algorithm that is based on the fact that if the policy gradient update is constrained by the total variation divergence, DTV (π1, π2 ) = max|π1 (·|s) − π2 (·|s)|, s∈S
Random skipped boluses: When it comes to the results using the extended action space TRPOe, we found that the results using 100 policy gradient iterations are inferior to the other results

Summary

Introduction

Type 1 Diabetes Mellitus (T1DM) is a metabolic disease caused by the autoimmune destruction of insulin-producing beta cells in the pancreas [1]. CSII treatment is a different strategy where the patient instead has an insulin pump that continuously infuses insulin The pump delivers both basal and bolus doses, where the basal rate consists of regularly infused short-acting insulin doses, while the boluses are activated by the user together with meal intakes and to account for hyperglycemia. With the improvement of modern treatment equipment, the combination of an insulin pump and CGM invites the addition of a third element, namely a control algorithm to substitute the operation of beta cells in the healthy pancreas. These three elements constitute the artificial pancreas [8,9]. Performance is measured through time-in-range (time spent on healthy blood glucose levels), time in hypo-/hyperglycemia, as well as blood glucose level plots for visual inspection

Related Work

Reinforcement Learning

Policy Gradient Methods

Parameterized Policies

Model Predictive Control

In-Silico Simulation

Simulator

Experiment Setup

Results

Virtual Population Experiment

Conclusions and Future Work

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Applied sciences	Publication Date: Sep 11, 2020
Citations: 9	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

In-Silico Evaluation of Glucose Regulation Using Policy Gradient Reinforcement Learning for Patients with Type 1 Diabetes Mellitus

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences

Lead the way for us

Similar Papers

Influence of one-day diabetes mellitus clinic management on blood glucose control and prognosis in patients with gestational diabetes mellitus
Wei Wang ... Peng Zhang
Gynecological endocrinology : the official journal of the International Society of Gynecological Endocrinology | VOL. 38
Wei Wang, et. al.Wei Wang ... Peng Zhang
18 Feb 2022
Gynecological endocrinology : the official journal of the International Society of Gynecological Endocrinology | VOL. 38

Internet-Based Medication Management Services Improve Glycated Hemoglobin Levels in Patients with Type 2 Diabetes.
Zhiwei Lu ... Jianbo Wang
TELEMEDICINE and e-HEALTH | VOL. 27
Zhiwei Lu, et. al.Zhiwei Lu ... Jianbo Wang
09 Sep 2020
TELEMEDICINE and e-HEALTH | VOL. 27

Influences of leukocytes in patients with type 2 diabetes and periodontitis to the effects of periodontal treatment on glycemic control
P C Huo ... Dongsiqi Jin
Chinese journal of stomatology | VOL. 57
P C Huo, et. al.P C Huo ... Dongsiqi Jin
02 Jul 2022
Chinese journal of stomatology | VOL. 57

Study of mobile medical software in improving blood glucose control of patients with type 2 diabetes mellitus
...
-
, et. al. ...
20 Jan 2019
20 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

In-Silico Evaluation of Glucose Regulation Using Policy Gradient Reinforcement Learning for Patients with Type 1 Diabetes Mellitus

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Applied sciences