Traditional Reinforcement Learning Algorithms Research Articles

To operate safely in a dynamic environment, autonomous vehicles must possess the same level of predictive driving abilities as human drivers and must be capable of anticipating the future actions of other dynamic objects in the environment, especially those of neighboring vehicles. The development of safe autonomous vehicles (AVs) poses a challenging task as it requires algorithms that can make real-time decisions in unpredictable circumstances. Reinforcement learning (RL) presents a promising approach for AV control, as it utilizes trial and error to enable optimal decision-making. However, traditional RL algorithms are unsuitable for safety-critical applications, as they may explore unsafe actions, potentially resulting in accidents. Safe reinforcement learning (SRL) algorithms have been developed to address this issue, prioritizing safe and reliable decisions. These algorithms incorporate constraints to prevent unsafe actions or utilize techniques to estimate action risk and avoid actions deemed excessively risky. Despite computational challenges, SRL holds significant promise for AV control, and is likely to play a crucial role in developing safe and reliable systems. SRL methods are critical for the general adoption of autonomous vehicles by guaranteeing their safety and reliability. These algorithms utilize methods like uncertainty and risk estimation along with penalty functions, to avoid excessively risky actions and have the potential to significantly reduce accidents and build public trust in autonomous driving. However, there are challenges that need to be addressed, such as the dynamic nature of real-world traffic, high computational costs, and the diversity of road design; and these varying conditions make the designing, testing, and validating of SRL algorithms difficult. Despite these challenges, SRL presents a promising solution, through integrating new sensing technologies and machine learning techniques, to develop safe, efficient, and environmentally friendly transportation systems.

Read full abstract

Addressing issues with traditional ant colony and reinforcement learning algorithms, such as low search efficiency and the tendency to produce insufficiently smooth paths that easily fall into local optima, this paper designs an improved ant colony optimization algorithm fusion with improved Q-Learning (IAC-IQL) algorithm for Bessel curve global path planning of search and rescue (SAR) robots. First, the heuristic function model in the ant colony algorithm is improved, the elite ant search strategy and the adaptive pheromone volatility factor strategy are introduced, and the initial path is searched in realize the motion environment with the help of the improved ant colony algorithm, and the initialized pheromone matrix is constructed. Second, the improved ant colony algorithm and Q-Learning (QL) algorithm are fused by utilizing the similarity between the pheromone matrix in the improved ant colony algorithm and the Q-matrix in the QL algorithm. A heuristic learning evaluation model is designed to dynamically adjust the learning factor and provide guidance for the search path. Additionally, a dynamic adaptive greedy strategy is introduced to balance the exploration and exploitation of the robot in the environment. Finally, the paths are smoothed using third-order Bessel curves to eliminate the problem of excessive steering angles. Through three sets of comparative simulation experiments conducted in Pycharm platform, the effectiveness, superiority, and practicality of the IAC-IQL algorithm were verified. The experimental results demonstrated that the IAC-IQL algorithm integrates the strong search capability of ant colony algorithm and the self-learning characteristics of QL algorithm. SAR robots equipped with the improved IAC-IQL algorithm exhibit significantly enhanced iterative search efficiency in grid simulation environment and image sampling simulation environment. The global path optimization indicators demonstrate high efficiency, and the paths are smoother.

Read full abstract

Traditional Reinforcement Learning Algorithms Research Articles

Related Topics

Articles published on Traditional Reinforcement Learning Algorithms

Noisy Dueling Double Deep Q-Network algorithm for autonomous underwater vehicle path planning.

A comprehensive review on safe reinforcement learning for autonomous vehicle control in dynamic environments

Improved ACO algorithm fused with improved Q-Learning algorithm for Bessel curve global path planning of search and rescue robots

A train trajectory optimization method based on the safety reinforcement learning with a relaxed dynamic reward

A Multiproject and Multilevel Plan Management Model Based on a Hybrid Program Evaluation and Review Technique and Reinforcement Learning Mechanism

Mars Exploration: Research on Goal-Driven Hierarchical DQN Autonomous Scene Exploration Algorithm

Optimization Effectiveness of Multi-Intelligence Consistent Energy System Based on Digital Advertising and Data Analysis

Optimization of Single-user Task Migration based on Improved DDPG

An obstacle avoidance method for robotic arm based on reinforcement learning

Federated deep reinforcement learning for task offloading and resource allocation in mobile edge computing-assisted vehicular networks

A review of research on reinforcement learning algorithms for multi-agents

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.

Evaluating the stealth of reinforcement learning-based cyber attacks against unknown scenarios using knowledge transfer techniques

Energy Management Strategy Based on Reinforcement Learning and Frequency Decoupling for Fuel Cell Hybrid Powertrain

Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

Learning strategies for underwater robot autonomous manipulation control

Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm

Optimization of Image Transmission in Cooperative Semantic Communication Networks

Electric vehicle charging navigation strategy in coupled smart grid and transportation network: A hierarchical reinforcement learning approach

Successful application of predictive information in deep reinforcement learning control: A case study based on an office building HVAC system

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Traditional Reinforcement Learning Algorithms Research Articles

Related Topics

Articles published on Traditional Reinforcement Learning Algorithms

Noisy Dueling Double Deep Q-Network algorithm for autonomous underwater vehicle path planning.

A comprehensive review on safe reinforcement learning for autonomous vehicle control in dynamic environments

Improved ACO algorithm fused with improved Q-Learning algorithm for Bessel curve global path planning of search and rescue robots

A train trajectory optimization method based on the safety reinforcement learning with a relaxed dynamic reward

A Multiproject and Multilevel Plan Management Model Based on a Hybrid Program Evaluation and Review Technique and Reinforcement Learning Mechanism

Mars Exploration: Research on Goal-Driven Hierarchical DQN Autonomous Scene Exploration Algorithm

Optimization Effectiveness of Multi-Intelligence Consistent Energy System Based on Digital Advertising and Data Analysis

Optimization of Single-user Task Migration based on Improved DDPG

An obstacle avoidance method for robotic arm based on reinforcement learning

Federated deep reinforcement learning for task offloading and resource allocation in mobile edge computing-assisted vehicular networks

A review of research on reinforcement learning algorithms for multi-agents

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.

Evaluating the stealth of reinforcement learning-based cyber attacks against unknown scenarios using knowledge transfer techniques

Energy Management Strategy Based on Reinforcement Learning and Frequency Decoupling for Fuel Cell Hybrid Powertrain

Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

Learning strategies for underwater robot autonomous manipulation control

Duty Cycle Scheduling in Wireless Sensor Networks Using an Exploratory Strategy-Directed MADDPG Algorithm

Optimization of Image Transmission in Cooperative Semantic Communication Networks

Electric vehicle charging navigation strategy in coupled smart grid and transportation network: A hierarchical reinforcement learning approach

Successful application of predictive information in deep reinforcement learning control: A case study based on an office building HVAC system