Deep Deterministic Policy Gradient Algorithm Research Articles

With the increasing global attention on energy efficiency and carbon emissions, the optimization of integrated energy systems (IES) has become the key to improve energy efficiency and reduce pollution emissions. However, most of the existing optimization methods cannot effectively deal with the complexity of high dimensional continuous action space. Therefore, this paper focuses on a novel multi-objective optimization strategy for the electricity–gas–heat integrated energy systems (EGH-IES). Firstly, considering the absorption capacity of wind power and the emission of pollutant gases, a multi-objective optimization model is constructed based on the mechanism model and operation constraints of each device in EGH-IES, in which the integrated operation cost and the environmental factors are taken as optimization objectives. Then, the multi-objective optimization problem is designed as the optimal strategy of interaction learning between agent and environment in reinforcement learning, and the output power of the devices constitutes the action of reinforcement learning. Additionally, the Ornstein–Uhlenbeck process is introduced to enhance the training efficiency and exploration performance of the agent, and the deep deterministic policy gradients (DDPG) algorithm is employed to optimize the action, thus the output power of the appliances could be obtained. Finally, the simulation results show that compared with deep Q network (DQN) method and proximal policy optimization (PPO) method, the reward function value of the proposed method increases by 2.43% and 6.09%, respectively, which represents a reduction in economic cost and pollutant emissions. These verify the effectiveness and superiority of the proposed multi-objective optimization scheme in cost reduction and benefit improvement for the EGH-IES.

Read full abstract

In the context of the global energy transition, optimizing deep-water oil and gas drilling parameters is crucial for ensuring safety while improving efficiency. Traditional methods face limitations in highly dynamic and nonlinear drilling environments, struggling to balance speed and cost-effectiveness. Furthermore, these methods rely on real-time logging-while-drilling (LWD) data for decision-making, but delays in data collection and processing hinder timely adjustments of drilling parameters, affecting decision accuracy and responsiveness. This paper proposes a multi-objective drilling parameter optimization framework, incorporating symbolic regression, time-series networks, and Markov decision processes to precisely predict ROP, formation conditions, and optimize drilling parameters in real time. Key innovations include a multi-population evolutionary symbolic regression algorithm for constructing empirical equations, the integration of variational mode decomposition (VMD) and sample entropy for data preprocessing, and multi-head self-attention time-series networks to enhance prediction accuracy. Quantile regression further estimates the range of drilling parameter adjustments. Additionally, a drilling parameter optimization deep deterministic policy gradient (DPODDPG) algorithm was developed to automate real-time parameter adjustments. Empirical analysis on the Ledong 10-1 block in the South China Sea demonstrated significant improvements: ROP increased from 54.18 m/hr to 122.17 m/hr, mechanical specific energy (MSE) decreased from 100.82 MPa to 97.78 MPa, and cost per foot reduced from 121.16 × 102 CNY/m to 51.31 × 102 CNY/m. Compared to traditional methods, the proposed framework showed clear advantages in enhancing ROP, reducing MSE, and controlling costs, further validating its superiority in complex drilling environments. This method not only significantly improves drilling efficiency and economic benefits but also adapts to complex and changing drilling conditions, showing broad application potential, particularly in challenging deep-water oil and gas drilling operations, where it can provide more efficient and reliable optimization solutions.

Read full abstract

Deep Deterministic Policy Gradient Algorithm Research Articles

Related Topics

Articles published on Deep Deterministic Policy Gradient Algorithm

A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

A novel method for solving dynamic flexible job-shop scheduling problem via DIFFormer and deep reinforcement learning

Improved DDPG algorithm-based path planning for unmanned surface vehicles

Decision-Making Policy for Autonomous Vehicles on Highways Using Deep Reinforcement Learning (DRL) Method

TD3-based trajectory optimization for energy consumption minimization in UAV-assisted MEC system

Human-in-the-Loop Reinforcement Learning in Continuous-Action Space.

Peer-to-Peer Energy Transactions for Prosumers Based on Improved Deep Deterministic Policy Gradient Algorithm

The docking control system of an autonomous underwater vehicle combining intelligent object recognition and deep reinforcement learning

Quadrotor Trajectory Tracking using Combined Stochastic Model-Free Position and DDPG-based Attitude Control

Deep reinforcement learning-based multi-objective optimization for electricity–gas–heat integrated energy systems

Design of sliding mode controller for servo feed system based on generalized extended state observer with reinforcement learning

Adaptive MPC path-tracking controller based on reinforcement learning and preview-based PID controller

Adaptive Deep Ant Colony Optimization–Asymmetric Strategy Network Twin Delayed Deep Deterministic Policy Gradient Algorithm: Path Planning for Mobile Robots in Dynamic Environments

Integrating model-driven and data-driven methods for under-frequency load shedding control

Reinforcement Learning-Based Optimal Hull Form Design with Variations in Fore and Aft Parts

A multi-objective reinforcement learning framework for real-time drilling optimization based on symbolic regression and perception

Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks.

Energy management with adaptive moving average filter and deep deterministic policy gradient reinforcement learning for fuel cell hybrid electric vehicles

Joint Computation Offloading and Trajectory Optimization for Edge Computing UAV: A KNN-DDPG Algorithm

Advanced security measures in coupled phase-shift STAR-RIS networks: A DRL approach

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Deep Deterministic Policy Gradient Algorithm Research Articles

Related Topics

Articles published on Deep Deterministic Policy Gradient Algorithm

A collaborative-learning multi-agent reinforcement learning method for distributed hybrid flow shop scheduling problem

A novel method for solving dynamic flexible job-shop scheduling problem via DIFFormer and deep reinforcement learning

Improved DDPG algorithm-based path planning for unmanned surface vehicles

Decision-Making Policy for Autonomous Vehicles on Highways Using Deep Reinforcement Learning (DRL) Method

TD3-based trajectory optimization for energy consumption minimization in UAV-assisted MEC system

Human-in-the-Loop Reinforcement Learning in Continuous-Action Space.

Peer-to-Peer Energy Transactions for Prosumers Based on Improved Deep Deterministic Policy Gradient Algorithm

The docking control system of an autonomous underwater vehicle combining intelligent object recognition and deep reinforcement learning

Quadrotor Trajectory Tracking using Combined Stochastic Model-Free Position and DDPG-based Attitude Control

Deep reinforcement learning-based multi-objective optimization for electricity–gas–heat integrated energy systems

Design of sliding mode controller for servo feed system based on generalized extended state observer with reinforcement learning

Adaptive MPC path-tracking controller based on reinforcement learning and preview-based PID controller

Adaptive Deep Ant Colony Optimization–Asymmetric Strategy Network Twin Delayed Deep Deterministic Policy Gradient Algorithm: Path Planning for Mobile Robots in Dynamic Environments

Integrating model-driven and data-driven methods for under-frequency load shedding control

Reinforcement Learning-Based Optimal Hull Form Design with Variations in Fore and Aft Parts

A multi-objective reinforcement learning framework for real-time drilling optimization based on symbolic regression and perception

Multi-Agent DRL for Air-to-Ground Communication Planning in UAV-Enabled IoT Networks.

Energy management with adaptive moving average filter and deep deterministic policy gradient reinforcement learning for fuel cell hybrid electric vehicles

Joint Computation Offloading and Trajectory Optimization for Edge Computing UAV: A KNN-DDPG Algorithm

Advanced security measures in coupled phase-shift STAR-RIS networks: A DRL approach