Q-learning Reinforcement Learning Research Articles

As today’s one of the hottest topics, machine learning brings about opportunities in various research areas. Moreover, computational intelligence and metaheuristics open up new strategies, which are shown to be efficient in solving optimization problems. However, studies bringing such remarkable approaches together are still lacking. In this context, the present paper introduces a Q-learning reinforcement learning strategy for binary optimization problems. The developed algorithm works as a reinforcement and recommendation system that evaluates the used algorithms, assigns rewards, promotes or demotes them. Thus, it invokes more promising optimizers more frequently. The proposed Q-learning algorithm uses Particle Swarm Optimization (PSO), Genetic Algorithm and a hybrid of these algorithms, namely, genetic-based PSO (gbPSO) as optimizers. Therefore, it is aimed to avoid local optima by using various optimizers and gathering additional statistical data. Secondarily, all optimizers are further enhanced by adopting an initial solution generation technique and triggered random immigrants mechanism to preserve swarm diversity. In addition to these procedures, a mutation procedure that decreases the diversity is adopted. Thus, more intensified search is encouraged towards the end of search. Moreover, while PSO requires for transfer functions in order to perform in binary spaces, the adopted and further improved gbPSO does not necessarily need such auxiliary procedures. Finally, the performances of all used algorithms are analysed on a recently caught on binary problem, namely, the set-union knapsack problem, which has a wide range of real-life applications. As demonstrated by the comprehensive experimental study and appropriate statistical tests, promising improvements are achieved.

Aiming at the problem that the excitation frequency and resonant frequency of the transducer cannot keep synchronous, the output amplitude decreases and the vibration is unstable. In this study, the working principle of piezoelectric transducers is firstly analyzed by the equivalent circuit method and the instantaneous characteristic variables (installing preload, assembly preload, load, and other factors) that affect the frequency matching obtained to establish the equivalent relationship with the resonant frequency. Secondly, in order to make the synchronization between excitation frequency and resonance frequency, the key instantaneous characteristic variables are extracted based on Pearson correlation coefficient. Thirdly, the mathematical model of the mapping relationships between instantaneous characteristic variables and resonance frequency is established with the radial basis function neural network (RBFNN). Fourthly, for the purpose of the adaptation to the characteristics of dynamic load and real-time frequency modulation in the operation of ultrasonic scalpels, the reinforcement learning (Q-learning algorithm) and the weight vector of RBFNN are used to define the eligibility trace, which is used to dynamically adjust the RBFNN frequency matching optimization model in real time and maintain the “constant” amplitude output and stable harmonic response process. Finally, the experimental results show that, compared with the traditional methods, the frequency matching optimization model of ultrasonic scalpel transducer based on RBF neural network and Q-Learning reinforcement learning is effective, and the vibration amplitude of the transducer is increased by 15.25μm. The amplitude fluctuation is stable at 0.92μm. It can provide decision-making guidance for relevant engineering fields.

Q-learning Reinforcement Learning Research Articles

Related Topics

Articles published on Q-learning Reinforcement Learning

Coordinated control of yaw and roll stability in heavy vehicles considering dynamic safety requirements

Optimisation tool: Q-learning and its application in various fields

AI-Based Q-Learning Approach for Performance Optimization in MIMO-NOMA Wireless Communication Systems

Deep Q-Network Approach for Train Timetable Rescheduling Based on Alternative Graph

Positioning an electric wheelchair in 2D grid map based on natural landmarks for navigation using Q-learning

Improved reinforcement learning path planning algorithm integrating prior knowledge.

Parallel hyper heuristic algorithm based on reinforcement learning for the corridor allocation problem and parallel row ordering problem

A reinforcement learning based computational intelligence approach for binary optimization problems: The case of the set-union knapsack problem

Lightweight Task Coordination of LEO Satellite Cluster based on Distributed Reinforcement Learning

Embedded Learning Approaches in the Whale Optimizer to Solve Coverage Combinatorial Problems

Frequency matching optimization model of ultrasonic scalpel transducer based on neural network and reinforcement learning

Hyperparameter Optimization for the LSTM Method of AUV Model Identification Based on Q-Learning

Discrete Event Modeling and Simulation for Reinforcement Learning System Design

Constrained evolutionary optimization based on reinforcement learning using the objective function and constraints

CLASSQ-L: A Q-Learning Algorithm for Adversarial Real-Time Strategy Games

Integrating Production Planning with Truck-Dispatching Decisions through Reinforcement Learning While Managing Uncertainty

Optimization of thermal comfort, indoor quality, and energy-saving in campus classroom through deep Q learning

Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning

Crop Yield Prediction Using Deep Reinforcement Learning Model for Sustainable Agrarian Applications

Comparison of conventional and rapid-acting antidepressants in a rodent probabilistic reversal learning task.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Q-learning Reinforcement Learning Research Articles

Related Topics

Articles published on Q-learning Reinforcement Learning

Coordinated control of yaw and roll stability in heavy vehicles considering dynamic safety requirements

Optimisation tool: Q-learning and its application in various fields

AI-Based Q-Learning Approach for Performance Optimization in MIMO-NOMA Wireless Communication Systems

Deep Q-Network Approach for Train Timetable Rescheduling Based on Alternative Graph

Positioning an electric wheelchair in 2D grid map based on natural landmarks for navigation using Q-learning

Improved reinforcement learning path planning algorithm integrating prior knowledge.

Parallel hyper heuristic algorithm based on reinforcement learning for the corridor allocation problem and parallel row ordering problem

A reinforcement learning based computational intelligence approach for binary optimization problems: The case of the set-union knapsack problem

Lightweight Task Coordination of LEO Satellite Cluster based on Distributed Reinforcement Learning

Embedded Learning Approaches in the Whale Optimizer to Solve Coverage Combinatorial Problems

Frequency matching optimization model of ultrasonic scalpel transducer based on neural network and reinforcement learning

Hyperparameter Optimization for the LSTM Method of AUV Model Identification Based on Q-Learning

Discrete Event Modeling and Simulation for Reinforcement Learning System Design

Constrained evolutionary optimization based on reinforcement learning using the objective function and constraints

CLASSQ-L: A Q-Learning Algorithm for Adversarial Real-Time Strategy Games

Integrating Production Planning with Truck-Dispatching Decisions through Reinforcement Learning While Managing Uncertainty

Optimization of thermal comfort, indoor quality, and energy-saving in campus classroom through deep Q learning

Multi-AUV Collaborative Target Recognition Based on Transfer-Reinforcement Learning

Crop Yield Prediction Using Deep Reinforcement Learning Model for Sustainable Agrarian Applications

Comparison of conventional and rapid-acting antidepressants in a rodent probabilistic reversal learning task.