Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care.

Ali Shirali,Alexander Schubert,Ahmed Alaa

doi:10.1109/jbhi.2024.3415115

Abstract

Medical treatments often involve a sequence of decisions, each informed by previous outcomes. This process closely aligns with reinforcement learning (RL), a framework for optimizing sequential decisions to maximize cumulative rewards under unknown dynamics. While RL shows promise for creating data-driven treatment plans, its application in medical contexts is challenging due to the frequent need to use sparse rewards, primarily defined based on mortality outcomes. This sparsity can reduce the stability of offline estimates, posing a significant hurdle in fully utilizing RL for medical decision-making. We introduce a deep Q-learning approach to obtain more reliable critical care policies by integrating relevant but noisy frequently measured biomarker signals into the reward specification without compromising the optimization of the main outcome. Our method prunes the action space based on all available rewards before training a final model on the sparse main reward. This approach minimizes potential distortions of the main objective while extracting valuable information from intermediate signals to guide learning. We evaluate our method in off-policy and offline settings using simulated environments and real health records from intensive care units. Our empirical results demonstrate that our method outperforms common offline RL methods such as conservative Q-learning and batch-constrained deep Q-learning. By disentangling sparse rewards and frequently measured reward proxies through action pruning, our work represents a step towards developing reliable policies that effectively harness the wealth of available information in data-intensive critical care environments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care.

Abstract

Talk to us

Similar Papers

More From: IEEE journal of biomedical and health informatics

Lead the way for us

Similar Papers

36th International Symposium on Intensive Care and Emergency Medicine : Brussels, Belgium. 15-18 March 2016.
...
Critical Care | VOL. 20
, et. al. ...
01 Apr 2016
36th International Symposium on Intensive Care and Emergency Medicine : Brussels, Belgium. 15-18 March 2016.
...

A Second Set of Eyes: An Introduction to Tele-ICU
Susan F Goran
Critical Care Nurse | VOL. 30
Susan F GoranSusan F Goran
31 Jul 2010
Critical Care Nurse | VOL. 30

Route, early or energy? \u2026 Protein improves protein balance in critically ill patients
Peter J M Weijs
Critical Care | VOL. 22
Peter J M WeijsPeter J M Weijs
14 Apr 2018
Route, early or energy? \u2026 Protein improves protein balance in critically ill patients
Peter J M Weijs

Implementing a palliative approach in the intensive care unit: an oxymoron or a realistic possibility?
Fakhri Athari ... Ken M Hillman
International Journal of Palliative Nursing | VOL. 22
Fakhri Athari, et. al.Fakhri Athari ... Ken M Hillman
02 Apr 2016
International Journal of Palliative Nursing | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care.

Abstract

Talk to us

Similar Papers

More From: IEEE journal of biomedical and health informatics