Replay Mechanism Research Articles

In the field of ocean energy detection, Autonomous Underwater Vehicles (AUVs) offer significant advantages in terms of manpower, resource, and energy efficiency. However, the unpredictable nature of the ocean environment, particularly the real-time changes in ocean currents, poses navigational risks for AUVs. Therefore, effective path planning in dynamic environments is crucial for AUVs to perform specific tasks. This paper addresses the static path planning problem and proposes a model called the noise net double DQN network with prioritized experience replay (N-DDQNP). The N-DDQNP model combines a noise network and a prioritized experience replay mechanism to address the limited exploration and slow convergence speed issues of the DQN algorithm, which are caused by the greedy strategy and uniform sampling mechanism. The proposed approach involves constructing a double DQN network with a priority experience replay and an exploration mechanism using the noise network. Second, a compound reward function is formulated to take into account ocean current, distance, and safety factors, ensuring prompt feedback during the training process. Regarding the ocean current, the reward function is designed based on the angle between the current direction and the AUV's heading direction, considering its impact on the AUV's speed. As for the distance factor, the reward is determined by the Euclidean distance between the current position and the target point. Furthermore, the safety factor considers whether the AUV may collide with obstacles. By incorporating these three factors, the compound reward function is established. To evaluate the performance of the N-DDQNP model, experiments were conducted using real ocean data in various complex ocean environments. The results demonstrate that the path planning time of the N-DDQNP model outperforms other algorithms in different ocean current scenarios and obstacle environments. Furthermore, a user console-AUV connection has been established using spice cloud desktop technology. The cloud desktop architecture enables intuitive observation of the AUV's navigation posture and the surrounding marine environment, facilitating safer and more efficient underwater exploration and marine resource detection tasks.

Read full abstract

Outdated channel quality indicator (CQI) feedback causes severe performance degradation of traditional link adaptation (LA) techniques in long term evolution (LTE) and new radio (NR) systems. This paper puts forth a deep reinforcement learning (DRL) based link adaptation (LA) technique, referred to as deep reinforcement learning link adaptation (DRLLA), to select efficient modulation and coding scheme (MCS) in the presence of the outdated CQI feedback. The goal of DRLLA is to maximize the link throughput while achieving a low block error rate (BLER). We first give explicit definitions of state, action, and reward in DRL paradigms, thereby realizing DRLLA. Then, to trade off the throughput against the BLER, we further develop a new experience replay mechanism called classified experience replay (CER) as the underpinning technique in DRLLA. In CER, experiences are separated into two buckets, one for successful experiences and the other for failed experiences, and then a fixed proportion from each is sampled to replay. The essence of CER is to obtain different trade-offs via adjusting the proportion among different training experiences. Furthermore, to reduce the signaling overhead and the system reconfiguration cost caused by frequent MCS switching, we propose a new action selection strategy termed as switching-controlled <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\epsilon$</tex-math></inline-formula> -greedy (SC- <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\epsilon$</tex-math></inline-formula> -greedy) for DRLLA. Simulation results demonstrate that compared with the state-of-the-art OLLA, LTSLA, and DRLLA with other experience replay mechanisms, DRLLA with CER can achieve higher throughput and lower BLER in various time-varying scenarios, and be more robust to different CQI feedback delays and CQI reporting periods. Furthermore, with the SC- <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$\epsilon$</tex-math></inline-formula> -greedy policy, DRLLA can capture better trade-offs between the link transmission quality and the MCS switching overhead compared with other baselines.

Read full abstract

Replay Mechanism Research Articles

Articles published on Replay Mechanism

Task offloading and trajectory scheduling for UAV-enabled MEC networks: An MADRL algorithm with prioritized experience replay

A Novel Path Planning Approach for Mobile Robot in Radioactive Environment Based on Improved Deep Q Network Algorithm

A Multiobjective Collaborative Deep Reinforcement Learning Algorithm for Jumping Optimization of Bipedal Robot

Data-Driven Load Frequency Control Based on Multi-Agent Reinforcement Learning With Attention Mechanism

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots.

Spiking generative networks empowered by multiple dynamic experts for lifelong learning

Ethical and moral decision-making for self-driving cars based on deep reinforcement learning

Control of a nonlinear active suspension system based on deep reinforcement learning and expert demonstrations

AUV Path Planning Considering Ocean Current Disturbance Based on Cloud Desktop Technology.

A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design.

Improved PER-DDPG based nonparametric modeling of ship dynamics with uncertainty

Research on energy management of hydrogen electric coupling system based on deep reinforcement learning

Model-Free Control in Wireless Cyber-Physical System With Communication Latency: A DRL Method With Improved Experience Replay.

Federated Zero-Shot Industrial Fault Diagnosis With Cloud-Shared Semantic Knowledge Base

UAV Air Game Maneuver Decision-Making Using Dueling Double Deep Q Network with Expert Experience Storage Mechanism

Research on adaptive obstacle avoidance algorithm of robot based on DDPG-DWA

Deep Reinforcement Learning Based Link Adaptation Technique for LTE/NR Systems

Task offloading strategy and scheduling optimization for internet of vehicles based on deep reinforcement learning

FLIRRAS: Fast Learning With Integrated Reward and Reduced Action Space for Online Multitask Offloading

A model of hippocampal replay driven by experience and environmental structure facilitates spatial learning.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Replay Mechanism Research Articles

Articles published on Replay Mechanism

Task offloading and trajectory scheduling for UAV-enabled MEC networks: An MADRL algorithm with prioritized experience replay

A Novel Path Planning Approach for Mobile Robot in Radioactive Environment Based on Improved Deep Q Network Algorithm

A Multiobjective Collaborative Deep Reinforcement Learning Algorithm for Jumping Optimization of Bipedal Robot

Data-Driven Load Frequency Control Based on Multi-Agent Reinforcement Learning With Attention Mechanism

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots.

Spiking generative networks empowered by multiple dynamic experts for lifelong learning

Ethical and moral decision-making for self-driving cars based on deep reinforcement learning

Control of a nonlinear active suspension system based on deep reinforcement learning and expert demonstrations

AUV Path Planning Considering Ocean Current Disturbance Based on Cloud Desktop Technology.

A parallel heterogeneous policy deep reinforcement learning algorithm for bipedal walking motion design.

Improved PER-DDPG based nonparametric modeling of ship dynamics with uncertainty

Research on energy management of hydrogen electric coupling system based on deep reinforcement learning

Model-Free Control in Wireless Cyber-Physical System With Communication Latency: A DRL Method With Improved Experience Replay.

Federated Zero-Shot Industrial Fault Diagnosis With Cloud-Shared Semantic Knowledge Base

UAV Air Game Maneuver Decision-Making Using Dueling Double Deep Q Network with Expert Experience Storage Mechanism

Research on adaptive obstacle avoidance algorithm of robot based on DDPG-DWA

Deep Reinforcement Learning Based Link Adaptation Technique for LTE/NR Systems

Task offloading strategy and scheduling optimization for internet of vehicles based on deep reinforcement learning

FLIRRAS: Fast Learning With Integrated Reward and Reduced Action Space for Online Multitask Offloading

A model of hippocampal replay driven by experience and environmental structure facilitates spatial learning.