Reinforcement Learning Model Research Articles

Pancreatic ductal adenocarcinoma is forecast to become the second most significant cause of cancer mortality as the number of patients with cancer in the main duct of the pancreas grows, and measurement of the pancreatic duct diameter from medical images has been identified as relevant for its early diagnosis. We propose an automated pancreatic duct centerline tracing method from computed tomography (CT) images that is based on deep reinforcement learning, which employs an artificial agent to interact with the environment and calculates rewards by combining the distances from the target and the centerline. A deep neural network is implemented to forecast step-wise values for each potential action. With the help of this mechanism, the agent can probe along the pancreatic duct centerline using the best possible navigational path. To enhance the tracing accuracy, we employ landmark-based registration, which enables the generation of a probability map of the pancreatic duct. Subsequently, we utilize a gradient-based method on the registered data to extract a probability map specifically indicating the centerline of the pancreatic duct. Three datasets with a total of 115 CT images were used to evaluate the proposed method. Using image hold-out from the first two datasets, the method performance was 2.0, 4.0, and 2.1mm measured in terms of the mean detection error, Hausdorff distance (HD), and root mean squared error (RMSE), respectively. Using the first two datasets for training and the third one for testing, the method accuracy was 2.2, 4.9, and 2.6mm measured in terms of the mean detection error, HD, and RMSE, respectively. We present an algorithm for automated pancreatic duct centerline tracing using deep reinforcement learning. We observe that validation on an external dataset confirms the potential for practical utilization of the presented method.

Read full abstract

Abstract Background Recent evidence suggests that the guideline-directed anticoagulant therapy for atrial fibrillation (AF) remains controversial. Widely-used CHA2DS2-VASc score is solely based on limited traditional cardiovascular risk factors, omitting AF characteristics and other markers of thromboembolic risk. A more efficient, safer and more personalized anticoagulant approach is warranted. Purpose To develop a data-driven deep reinforcement learning (DRL) model for guiding dynamic anticoagulant treatment in AF patients to improve cardiovascular outcomes. Methods Participants of this study were enrolled from the multicentred China Atrial Fibrillation (China-AF) Registry between August 2011 and December 2022, who were with regular follow-up every 6 months. We excluded patients on warfarin at baseline due to its declining usage trend in non-valvular AF patients in China. The DRL model was trained in 70% randomly selected patients for optimal dynamic decision-making, and then subsequently tested in the remaining 30%. Data of sociodemographic characteristics, AF characteristics, medical history, lifestyle factors, laboratory examination, and medications were input for model training. Concordance rate between DRL model’s recommendations and physicians’ actual decisions of non-vitamin-K-antagonist oral anticoagulant (NOAC) prescription among all visits before censoring was calculated for each patient. Primary outcome was the composite of cardiovascular death, ischemic stroke, transient ischemic attack or systemic embolism (SSE), and major bleeding. Shapley additive explanation analysis ranked the most important factors affecting decision-making of the DRL model. Results A total of 20068 patients (mean age: 63.0±12.0 years; 36.2% female) were randomly divided into a training cohort of 14050 patients and a testing cohort of 6018 patients. The model’s NOAC recommendations were mostly affected by age, prior NOAC prescription, body mass index, hypertension history and prior statin prescription (Figure 1). Patients with concordance rates of 50.1%-75% and 75.1%-100% had significant risk reductions for the primary outcome (adjusted HR =0.63; 95% CI, 0.46-0.85; P = 0.003 and adjusted HR =0.59; 95% CI, 0.46-0.75; P &lt;0.001, respectively), compared to those with a concordance rate of 0-25%. Similar results were observed for all-cause death, cardiovascular death and SSE outcomes, that patients with the highest concordance rate had a significant lower risk, compared to those with the lowest concordance rate. There was a nonsignificant but similar trend with regard to major bleeding events (Figure 2). Conclusions This modelling study suggests that a data-driven DRL model might provide more efficient, safer and more personalized anticoagulant suggestions, potentially assisting physicians in clinical practice.

Read full abstract

Reinforcement Learning Model Research Articles

Related Topics

Articles published on Reinforcement Learning Model

Reinforcement learning model for optimizing dexmedetomidine dosing to prevent delirium in critically ill patients

Developing a jam-absorption strategy for mixed traffic flow at signalized intersections using deep reinforcement learning

Optimizing intelligent penetration path planning using reinforcement learning: A focus on valid action masking and sample enhancement

Reinforcement learning in sentiment analysis: a review and future directions

Exploring the limits of hierarchical world models in reinforcement learning.

A Survey of AI-Powered Proactive Threat-Hunting Techniques: Challenges and Future Directions

Centerline-guided reinforcement learning model for pancreatic duct identifications.

Risk-averse supply chain management via robust reinforcement learning

Human-like Decision Making for autonomous lane change driving: a hybrid inverse reinforcement learning with game-theoretical vehicle interaction model

Reinforcement learning in motor skill acquisition: using the reward positivity to understand the mechanisms underlying short- and long-term behavior adaptation.

Hybrid Artificial Intelligence Strategies for Drone Navigation

Dynamic recommendation for anticoagulant treatment in patients with atrial fibrillation using deep reinforcement learning method

Integration of investor behavioral perspective and climate change in reinforcement learning for portfolio optimization

Youths' sensitivity to social media feedback: A computational account.

Autonomous driving decision-making control algorithm based on hierarchical reinforcement learning

A Generalized Deep Reinforcement Learning Model for Distribution Network Reconfiguration with Power Flow-Based Action-Space Sampling

Reinforcement Learning-Based Optimal Hull Form Design with Variations in Fore and Aft Parts

Deep reinforcement learning augmented decision model for intelligent driving cars

Reward-driven cerebellar climbing fiber activity influences both neural and behavioral learning.

Reinforcement learning-based assimilation of the WOFOST crop model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning Model Research Articles

Related Topics

Articles published on Reinforcement Learning Model

Reinforcement learning model for optimizing dexmedetomidine dosing to prevent delirium in critically ill patients

Developing a jam-absorption strategy for mixed traffic flow at signalized intersections using deep reinforcement learning

Optimizing intelligent penetration path planning using reinforcement learning: A focus on valid action masking and sample enhancement

Reinforcement learning in sentiment analysis: a review and future directions

Exploring the limits of hierarchical world models in reinforcement learning.

A Survey of AI-Powered Proactive Threat-Hunting Techniques: Challenges and Future Directions

Centerline-guided reinforcement learning model for pancreatic duct identifications.

Risk-averse supply chain management via robust reinforcement learning

Human-like Decision Making for autonomous lane change driving: a hybrid inverse reinforcement learning with game-theoretical vehicle interaction model

Reinforcement learning in motor skill acquisition: using the reward positivity to understand the mechanisms underlying short- and long-term behavior adaptation.

Hybrid Artificial Intelligence Strategies for Drone Navigation

Dynamic recommendation for anticoagulant treatment in patients with atrial fibrillation using deep reinforcement learning method

Integration of investor behavioral perspective and climate change in reinforcement learning for portfolio optimization

Youths' sensitivity to social media feedback: A computational account.

Autonomous driving decision-making control algorithm based on hierarchical reinforcement learning

A Generalized Deep Reinforcement Learning Model for Distribution Network Reconfiguration with Power Flow-Based Action-Space Sampling

Reinforcement Learning-Based Optimal Hull Form Design with Variations in Fore and Aft Parts

Deep reinforcement learning augmented decision model for intelligent driving cars

Reward-driven cerebellar climbing fiber activity influences both neural and behavioral learning.

Reinforcement learning-based assimilation of the WOFOST crop model