Soft Actor-critic Research Articles

Abstract The autonomous navigation and obstacle avoidance capabilities of autonomous underwater vehicles (AUVs) are essential for ensuring their safe navigation and long-term, efficient operation. However, the complexity of the marine environment poses significant challenges to safe and effective obstacle avoidance. To address this issue, this study proposes an AUV obstacle avoidance control algorithm based on offline reinforcement learning. This method adopts the Conservative Q-learning (CQL) algorithm, which is based on the Soft Actor-Critic (SAC) framework. It learns from obtained historical obstacle avoidance data and ultimately achieves a favorable obstacle avoidance control strategy. In this method, PID and SAC control algorithms are utilized to generate expert obstacle avoidance data to construct a diversified offline database. Additionally, based on the line-of-sight (LOS) guidance method and artificial potential field (APF) method, information regarding the distance and orientation of targets and obstacles is incorporated into the state space, and heading and obstacle avoidance reward terms are integrated into the reward function design. The algorithm successfully guides the AUV in autonomous navigation and dynamic obstacle avoidance in three-dimensional space. Furthermore, the algorithm exhibits a certain degree of anti-interference capability against uncertain disturbances and ocean currents, enhancing the safety and robustness of the AUV system. Simulation results fully demonstrate the feasibility and effectiveness of the intelligent obstacle avoidance method based on offline reinforcement learning. This study highlights the profound significance of offline reinforcement learning in enabling robust and reliable control systems for AUVs, paving the way for enhanced operational capabilities in challenging marine environments.

Read full abstract

Identifying the time-varying control schemes that maximize storage performance is critical to the commercial deployment of geological carbon storage (GCS) projects. However, the optimization process typically demands extensive resource-intensive simulation evaluations, which poses significant computational challenges and practical limitations. In this study, we presented the multimodal latent dynamic (MLD) model, a novel deep learning framework for fast flow prediction and well control optimization in GCS operations. The MLD model implicitly characterizes the forward compositional simulation process through three components: a representation module that learns compressed latent representations of the system, a transition module that approximates the evolution of the system states in the low-dimensional latent space, and a prediction module that forecasts the flow responses for given well controls. A novel model training strategy combining a regression loss and a joint-embedding consistency loss was introduced to jointly optimize the three modules, which enhances the temporal consistency of the learned representations and ensures multi-step prediction accuracy. Unlike most existing deep learning models designed for systems with specific parameters, the MLD model supports arbitrary input modalities, thereby enabling comprehensive consideration of interactions between diverse types of data, including dynamic state variables, static spatial system parameters, rock and fluid properties, as well as external well settings. Since the MLD model mirrors the structure of a Markov decision process (MDP) that computes state transitions and rewards (i.e., economic calculation for flow responses) for given states and actions, it can serve as an interactive environment to train deep reinforcement learning agents. Specifically, the soft actor-critic (SAC) algorithm was employed to learn an optimal control policy that maximizes the net present value (NPV) from the experiences gained by continuous interactions with the MLD model. The efficacy of the proposed approach was first compared against commonly used simulation-based evolutionary algorithm and surrogate-assisted evolutionary algorithm on a deterministic GCS optimization case, showing that the proposed approach achieves the highest NPV, while reducing the required computational resources by more than 60%. The framework was further applied to the generalizable GCS optimization case. The results indicate that the trained agent is capable of harnessing the knowledge learned from previous scenarios to provide improved decisions for newly encountered scenarios, demonstrating promising generalization performance.

Read full abstract

Soft Actor-critic Research Articles

Related Topics

Articles published on Soft Actor-critic

Stabilization of Phasor Measurement Sensor-Based Markovian Jump CPSs Through Soft Actor–Critic

Low-Carbon Dispatch Method for Active Distribution Network Based on Carbon Emission Flow Theory

Evade Unknown Pursuer via Pursuit Strategy Identification and Model Reference Policy Adaptation (MRPA) Algorithm

Soft Actor-Critic Approach to Self-Adaptive Particle Swarm Optimisation

A deep reinforcement learning based charging and discharging scheduling strategy for electric vehicles

Optimization of emergency frequency control strategy for power systems considering both source and load uncertainties

A smart home energy management system based on human activity recognition and deep reinforcement learning

Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning

Empowering legal justice with AI: A reinforcement learning SAC-VAE framework for advanced legal text summarization.

Research on robust decision making for intelligent connected vehicle at highway on-ramp

Deep Reinforcement Learning for Autonomous Driving Systems

A path planning method based on deep reinforcement learning for AUV in complex marine environment

Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning

Simultaneous Optimization of Discrete and Continuous Parameters Defining a Robot Morphology and Controller.

Reinforcement learning for heliostat aiming: Improving the performance of Solar Tower plants

Enhancing the Minimum Awareness Failure Distance in V2X Communications: A Deep Reinforcement Learning Approach.

Advancements in UAV Path Planning: A Deep Reinforcement Learning Approach with Soft Actor-Critic for Enhanced Navigation

Sea-Based UAV Network Resource Allocation Method Based on an Attention Mechanism

Achieving Robust Learning Outcomes in Autonomous Driving with DynamicNoise Integration in Deep Reinforcement Learning

Smart energy management for hybrid electric bus via improved soft actor-critic algorithm in a heuristic learning framework

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Soft Actor-critic Research Articles

Related Topics

Articles published on Soft Actor-critic

Stabilization of Phasor Measurement Sensor-Based Markovian Jump CPSs Through Soft Actor–Critic

Low-Carbon Dispatch Method for Active Distribution Network Based on Carbon Emission Flow Theory

Evade Unknown Pursuer via Pursuit Strategy Identification and Model Reference Policy Adaptation (MRPA) Algorithm

Soft Actor-Critic Approach to Self-Adaptive Particle Swarm Optimisation

A deep reinforcement learning based charging and discharging scheduling strategy for electric vehicles

Optimization of emergency frequency control strategy for power systems considering both source and load uncertainties

A smart home energy management system based on human activity recognition and deep reinforcement learning

Research on obstacle avoidance of underactuated autonomous underwater vehicle based on offline reinforcement learning

Empowering legal justice with AI: A reinforcement learning SAC-VAE framework for advanced legal text summarization.

Research on robust decision making for intelligent connected vehicle at highway on-ramp

Deep Reinforcement Learning for Autonomous Driving Systems

A path planning method based on deep reinforcement learning for AUV in complex marine environment

Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning

Simultaneous Optimization of Discrete and Continuous Parameters Defining a Robot Morphology and Controller.

Reinforcement learning for heliostat aiming: Improving the performance of Solar Tower plants

Enhancing the Minimum Awareness Failure Distance in V2X Communications: A Deep Reinforcement Learning Approach.

Advancements in UAV Path Planning: A Deep Reinforcement Learning Approach with Soft Actor-Critic for Enhanced Navigation

Sea-Based UAV Network Resource Allocation Method Based on an Attention Mechanism

Achieving Robust Learning Outcomes in Autonomous Driving with DynamicNoise Integration in Deep Reinforcement Learning

Smart energy management for hybrid electric bus via improved soft actor-critic algorithm in a heuristic learning framework