Reinforcement Learning Problem Research Articles

Ultrasound image acquisition in conventional transesophageal echocardiography (TEE) requires complex manual operation of the probe in the esophagus based on the interpretation of ultrasound images and in-depth knowledge of the cardiac anatomy. In this work, we formulate the TEE probe guidance task as a reinforcement learning (RL) problem, and present the first learning-based solution to 3-DOF control of a TEE probe based on the ultrasound image feedback, named RL-TEE, in order to mimic the visual search and navigation strategies of expert echocardiographers. The probe-tissue interaction in TEE is carefully modeled in our framework by considering both the requirements for navigation towards the standard views and compliance in the esophageal environment. Furthermore, we propose a hybrid deep Q-network model that augments a convolutional neural network backbone with self-attention mechanisms to better capture spatial information in ultrasound images to guide navigation decisions. The presented methods are preliminarily validated in a TEE simulation environment built with data from 25 subjects to acquire four standard views of the heart. Our results show that the proposed method can effectively learn to accurately and compliantly guide the probe movement for TEE standard view acquisition tasks and has a good generalization ability to unseen patient data. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —The motivation of this paper is to realize 3-DOF movement guidance of a TEE probe to acquire the standard views of the heart based on the real-time images, which can be applied to existing robotic control systems or used to assist novice echocardiographers in TEE examination, thereby relieving operator workload and improving ease of use. This paper suggests a novel approach that uses the deep RL technique to achieve automatic interpretation of TEE images and intelligent guidance of the probe movement. The RL framework is designed to take into account both the navigation efficiency and compliance with the esophageal environment for the targeted intracorporeal application. A hybrid deep Q-network model that augments a convolutional neural network with attention mechanisms is designed to better capture spatial information from ultrasound images to predict the probe movement. The effectiveness of the framework is preliminarily validated in extensive experiments in a simulation environment built with real patient data. The proposed method can be applied in clinical use to provide real-time TEE probe guidance for novice echocardiographers, and can be integrated with a robotic system to fully automate the TEE acquisition, thereby relieving the doctors from tedious manual operation to focus on the diagnosis and treatment.

Abstract Space layout design is a critical aspect of architectural design, influencing functionality and aesthetics. The inherent combinatorial nature of layout design poses challenges for traditional planning approaches; thus, it demands the exploration of novel methods. This paper presents a novel framework that leverages the potential of deep reinforcement learning (RL) algorithms to optimize space layouts. RL has demonstrated remarkable success in addressing complex decision-making problems, yet its application in the design process remains relatively unexplored. We argue that RL is particularly well-suited for the design process due to its ability to accommodate offline tasks and seamless integration with existing computer-aided design software, effectively acting as a simulator for design exploration. Framing space layout design as an RL problem and employing RL methods allows for the automated exploration of the expansive design space, thereby enhancing the discovery of innovative solutions. This paper also elucidates the synergy between the design process and the RL problem, which opens new avenues for exploring the potential of RL algorithms in design. We aim to foster experimentation and collaboration within the RL and architecture communities. To facilitate our research, we have developed SpaceLayoutGym, an environment specifically designed for space layout design tasks. SpaceLayoutGym serves as a customizable environment that encapsulates the essential elements of the layout design process within an RL framework. To showcase the effectiveness of SpaceLayoutGym and the capabilities of RL as an artificial space layout designer, we employ the Proximal Policy Optimization (PPO) algorithm to train the RL agent in selected design scenarios with both geometrical constraints and topological objectives. The study further extends to contrast the effectiveness of PPO agents with that of genetic algorithms, and also includes a comparative analysis with existing layouts. Our results demonstrate the potential of RL to optimize space layouts, offering a promising direction for the future of artificial intelligence-aided design.

Reinforcement Learning Problem Research Articles

Related Topics

Articles published on Reinforcement Learning Problem

Guiding real-world reinforcement learning for in-contact manipulation tasks with Shared Control Templates

Multi-fidelity reinforcement learning with control variates

Attention-Based Intrinsic Reward Mixing Network for Credit Assignment in Multiagent Reinforcement Learning

Joint learning of reward machines and policies in environments with partially known semantics

Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations.

Teach and Explore: A Multiplex Information-guided Effective and Efficient Reinforcement Learning for Sequential Recommendation

Deep Reinforcement Learning for Mobile Robot Path Planning

Deep reinforcement learning sensor scheduling for effective monitoring of dynamical systems

Graph Representations for Reinforcement Learning

Intent-based AI system in packet-optical networks towards 6G [Invited

RL-TEE: Autonomous Probe Guidance for Transesophageal Echocardiography Based on Attention-Augmented Deep Reinforcement Learning

A Review on NEAT and Other Reinforcement Algorithms in Robotics

Reimagining space layout design through deep reinforcement learning

I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets Initiatives

Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming

Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning

ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference

Parameterized Projected Bellman Operator

Offline Model-Based Optimization via Policy-Guided Gradient Search

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Reinforcement Learning Problem Research Articles

Related Topics

Articles published on Reinforcement Learning Problem

Guiding real-world reinforcement learning for in-contact manipulation tasks with Shared Control Templates

Multi-fidelity reinforcement learning with control variates

Attention-Based Intrinsic Reward Mixing Network for Credit Assignment in Multiagent Reinforcement Learning

Joint learning of reward machines and policies in environments with partially known semantics

Combining Reinforcement Learning and Tensor Networks, with an Application to Dynamical Large Deviations.

Teach and Explore: A Multiplex Information-guided Effective and Efficient Reinforcement Learning for Sequential Recommendation

Deep Reinforcement Learning for Mobile Robot Path Planning

Deep reinforcement learning sensor scheduling for effective monitoring of dynamical systems

Graph Representations for Reinforcement Learning

Intent-based AI system in packet-optical networks towards 6G [Invited

RL-TEE: Autonomous Probe Guidance for Transesophageal Echocardiography Based on Attention-Augmented Deep Reinforcement Learning

A Review on NEAT and Other Reinforcement Algorithms in Robotics

Reimagining space layout design through deep reinforcement learning

I Open at the Close: A Deep Reinforcement Learning Evaluation of Open Streets Initiatives

Learning to Stop Cut Generation for Efficient Mixed-Integer Linear Programming

Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning

ConsistentEE: A Consistent and Hardness-Guided Early Exiting Method for Accelerating Language Models Inference

Parameterized Projected Bellman Operator

Offline Model-Based Optimization via Policy-Guided Gradient Search

Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning