Classical Reinforcement Learning Research Articles

In recent years, the proliferation of Massive Open Online Courses (MOOC) platforms on a global scale has been remarkable. Learners can now meet their learning demands with the help of MOOC. However, learners might not understand the course material well if they have access to a lot of information due to their inadequate expertise and cognitive ability. Personalized Recommender Systems (RSs), a cutting-edge technology, can assist in addressing this issue. It greatly increases resource acquisition through personalized availability for various people of all ages. Intelligent learning methods, such as machine learning and Reinforcement Learning (RL) can be used in RS challenges. However, machine learning needs supervised data and classical RL is not suitable for multi-task recommendations in online learning platforms. To address these challenges, the proposed framework integrates a Deep Reinforcement Learning (DRL) and multi-agent approach. This adaptive system personalizes the learning experience by considering key factors such as learner sentiments, learning style, preferences, competency, and adaptive difficulty levels. We formulate the interactive RS problem using a DRL-based Actor-Critic model named DRR, treating recommendations as a sequential decision-making process. The DRR enables the system to provide top-N course recommendations and personalized learning paths, enriching the student's experience. Extensive experiments on a MOOC dataset such as the 100 K Coursera course review validate the proposed DRR model, demonstrating its superiority over baseline models in major evaluation metrics for long-term recommendations. The outcomes of this research contribute to the field of e-learning technology, guiding the design and implementation of course RSs, to facilitate personalized and relevant recommendations for online learning students.

Read full abstract

The application of drones carrying different devices for aerial hovering operations is becoming increasingly widespread, but currently there is very little research relying on reinforcement learning methods for hovering control, and it has not been implemented on physical machines. Drone’s behavior space regarding hover control is continuous and large-scale, making it difficult for basic algorithms and value-based reinforcement learning (RL) algorithms to have good results. In response to this issue, this article applies a watcher-actor-critic (WAC) algorithm to the drone’s hover control, which can quickly lock the exploration direction and achieve high robustness of the drone’s hover control while improving learning efficiency and reducing learning costs. This article first utilizes the actor-critic algorithm based on behavioral value Q (QAC) and the deep deterministic policy gradient algorithm (DDPG) for drone hover control learning. Subsequently, an actor-critic algorithm with an added watcher is proposed, in which the watcher uses a PID controller with parameters provided by a neural network as the dynamic monitor, transforming the learning process into supervised learning. Finally, this article uses a classic reinforcement learning environment library, Gym, and a current mainstream reinforcement learning framework, PARL, for simulation, and deploys the algorithm to a practical environment. A multi-sensor fusion strategy-based autonomous localization method for unmanned aerial vehicles is used for practical exercises. The simulation and experimental results show that the training episodes of WAC are reduced by 20% compared to the DDPG and 55% compared to the QAC, and the proposed algorithm has a higher learning efficiency, faster convergence speed, and smoother hovering effect compared to the QAC and DDPG.

Read full abstract

Classical Reinforcement Learning Research Articles

Related Topics

Articles published on Classical Reinforcement Learning

The Effect of Hyperparameters on the Model Convergence Rate of Cliff Walking Problem Based on Q-Learning

A Multi-Strategy Co-Evolutionary Particle Swarm Optimization Algorithm with Its Convergence Analysis

Risk-averse supply chain management via robust reinforcement learning

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations

Decision Transformer-Based Efficient Data Offloading in LEO-IoT.

Design and analysis of parallel quantum transfer fractal priority replay with dynamic memory algorithm in quantum reinforcement learning for robotics

Ex-RL: Experience-based reinforcement learning

Humans forage for reward in reinforcement learning tasks.

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.

Co-evolutionary traffic signal control using reinforcement learning for road networks under stochastic capacity

An adaptable and personalized framework for top-N course recommendations in online learning

UISA: User Information Separating Architecture for Commodity Recommendation Policy with Deep Reinforcement Learning

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

BadRL: Sparse Targeted Backdoor Attack against Reinforcement Learning

A Comprehensive Analysis of Game theory on Multi-Agent Reinforcement

Hybrid actor-critic algorithm for quantum reinforcement learning at CERN beam lines

Research advanced in the integration of federated learning and reinforcement learning

A Supervised Reinforcement Learning Algorithm for Controlling Drone Hovering

GREEN PATH: an expert system for space planning and design by the generation of human trajectories

Distributional reinforcement learning in prefrontal cortex

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Classical Reinforcement Learning Research Articles

Related Topics

Articles published on Classical Reinforcement Learning

The Effect of Hyperparameters on the Model Convergence Rate of Cliff Walking Problem Based on Q-Learning

A Multi-Strategy Co-Evolutionary Particle Swarm Optimization Algorithm with Its Convergence Analysis

Risk-averse supply chain management via robust reinforcement learning

Dopamine release plateau and outcome signals in dorsal striatum contrast with classic reinforcement learning formulations

Decision Transformer-Based Efficient Data Offloading in LEO-IoT.

Design and analysis of parallel quantum transfer fractal priority replay with dynamic memory algorithm in quantum reinforcement learning for robotics

Ex-RL: Experience-based reinforcement learning

Humans forage for reward in reinforcement learning tasks.

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning.

Co-evolutionary traffic signal control using reinforcement learning for road networks under stochastic capacity

An adaptable and personalized framework for top-N course recommendations in online learning

UISA: User Information Separating Architecture for Commodity Recommendation Policy with Deep Reinforcement Learning

OCEAN-MBRL: Offline Conservative Exploration for Model-Based Offline Reinforcement Learning

BadRL: Sparse Targeted Backdoor Attack against Reinforcement Learning

A Comprehensive Analysis of Game theory on Multi-Agent Reinforcement

Hybrid actor-critic algorithm for quantum reinforcement learning at CERN beam lines

Research advanced in the integration of federated learning and reinforcement learning

A Supervised Reinforcement Learning Algorithm for Controlling Drone Hovering

GREEN PATH: an expert system for space planning and design by the generation of human trajectories

Distributional reinforcement learning in prefrontal cortex