Internal Reinforcement Signal Research Articles

This paper proposes a TD (temporal difference) and GA (genetic algorithm)-based reinforcement (TDGAR) learning method and applies it to the control of a real magnetic bearing system. The TDGAR learning scheme is a new hybrid GA, which integrates the TD prediction method and the GA to perform the reinforcement learning task. The TDGAR learning system is composed of two integrated feedforward networks. One neural network acts as a critic network to guide the learning of the other network (the action network) which determines the outputs (actions) of the TDGAR learning system. The action network can be a normal neural network or a neural fuzzy network. Using the TD prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network. The action network uses the GA to adapt itself according to the internal reinforcement signal. The key concept of the TDGAR learning scheme is to formulate the internal reinforcement signal as the fitness function for the GA such that the GA can evaluate the candidate solutions (chromosomes) regularly, even during periods without external feedback from the environment. This enables the GA to proceed to new generations regularly without waiting for the arrival of the external reinforcement signal. This can usually accelerate the GA learning since a reinforcement signal may only be available at a time long after a sequence of actions has occurred in the reinforcement learning problem. The proposed TDGAR learning system has been used to control an active magnetic bearing (AMB) system in practice. A systematic design procedure is developed to achieve successful integration of all the subsystems including magnetic suspension, mechanical structure, and controller training. The results show that the TDGAR learning scheme can successfully find a neural controller or a neural fuzzy controller for a self-designed magnetic bearing system.

A genetic reinforcement neural network (GRNN) is proposed to solve various reinforcement learning problems. The proposed GRNN is constructed by integrating two feedforward multilayer networks. One neural network acts as an action network for determining the outputs (actions) of the GRNN, and the other as a critic network to help the learning of the action network. Using the temporal difference prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network. The action network uses the genetic algorithm (GA) to adapt itself according to the internal reinforcement signal. The key concept of the proposed GRNN learning scheme is to formulate the internal reinforcement signal as the fitness function for the GA. This learning scheme forms a novel hybrid GA, which consists of the temporal difference and gradient descent methods for the critic network learning, and the GA for the action network learning. By using the internal reinforcement signal as the fitness function, the GA can evaluate the candidate solutions (chromosomes) regularly, even during the period without external reinforcement feedback from the environment. Hence, the GA can proceed to new generations regularly without waiting for the arrival of the external reinforcement signal. This can usually accelerate the GA learning because a reinforcement signal may only be available at a lime long after a sequence of actions has occurred in the reinforcement learning problems. Computer simulations have been conducted to illustrate the performance and applicability of the proposed learning scheme.

Internal Reinforcement Signal Research Articles

Related Topics

Articles published on Internal Reinforcement Signal

Internal reinforcement adaptive dynamic programming for optimal containment control of unknown continuous-time multi-agent systems

The Boundedness Conditions for Model-Free HDP( λ ).

Fuzzy-Based Goal Representation Adaptive Dynamic Programming

Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming.

A Theoretical Foundation of Goal Representation Heuristic Dynamic Programming.

A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Radial basis function neural network-based adaptive critic control of induction motors

REINFORCEMENT LEARNING OF FUZZY LOGIC CONTROLLERS FOR QUADRUPED

GA-based fuzzy reinforcement learning for control of a magnetic bearing system

Controlling chaos by GA-based reinforcement learning neural network

A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

GA-based reinforcement learning for neural networks

Neuro-Resistive Grid approach to trainable controllers: A pole balancing example

Reinforcement learning for an ART-based fuzzy adaptive learning control network

Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Internal Reinforcement Signal Research Articles

Related Topics

Articles published on Internal Reinforcement Signal

Internal reinforcement adaptive dynamic programming for optimal containment control of unknown continuous-time multi-agent systems

The Boundedness Conditions for Model-Free HDP( λ ).

Fuzzy-Based Goal Representation Adaptive Dynamic Programming

Gr-GDHP: A New Architecture for Globalized Dual Heuristic Dynamic Programming.

A Theoretical Foundation of Goal Representation Heuristic Dynamic Programming.

A three-network architecture for on-line learning and optimization based on adaptive dynamic programming

Radial basis function neural network-based adaptive critic control of induction motors

REINFORCEMENT LEARNING OF FUZZY LOGIC CONTROLLERS FOR QUADRUPED

GA-based fuzzy reinforcement learning for control of a magnetic bearing system

Controlling chaos by GA-based reinforcement learning neural network

A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

GA-based reinforcement learning for neural networks

Neuro-Resistive Grid approach to trainable controllers: A pole balancing example

Reinforcement learning for an ART-based fuzzy adaptive learning control network

Reinforcement structure/parameter learning for neural-network-based fuzzy logic control systems