Discounted Value Function Research Articles

Overview

30 Articles

Published in last 50 years

Articles published on Discounted Value Function

30 Search results

Lifelong reinforcement learning tracking control of nonlinear strict-feedback systems using multilayer neural networks with constraints

This paper presents a novel safe integral reinforcement learning (IRL)-based optimal trajectory tracking scheme for nonlinear systems with uncertain dynamics that is subject to constraints. We leverage multilayer neural networks (MNNs) for actor-critic MNNs along with an NN identifier in the backstepping process for minimizing a discounted value function. A time-varying barrier Lyapunov function (TVBLF) is utilized for handling constraints and to provide safety assurances. Online weight update laws for the actor and critic MNNs are derived that are driven by Bellman error and control input error. We introduce an online lifelong learning (LL) method in the critic NN, utilizing the Bellman error in MNNs to address catastrophic forgetting. The method’s effectiveness is demonstrated through simulations on mobile robot multitask tracking. The paper concludes with a stability analysis of the closed-loop system.

Neurocomputing

Jul 8, 2024
Irfan Ganie + 1

Editage

Paperpal

R Discovery

Mind the Graph

Discounted Value Function Research Articles

Related Topics

Articles published on Discounted Value Function

Lifelong reinforcement learning tracking control of nonlinear strict-feedback systems using multilayer neural networks with constraints

Optimal trajectory tracking of uncertain nonlinear continuous-time strict-feedback systems with dynamic constraints

Robust Average-Reward Reinforcement Learning

Continual online learning-based optimal tracking control of nonlinear strict-feedback systems: application to unmanned aerial vehicles

Optimal Learning Output Tracking Control: A Model-Free Policy Optimization Method With Convergence Analysis.

Markov decision processes under risk sensitivity: A discount vanishing approach

Robust Average-Reward Markov Decision Processes

Space manipulator optimal impedance control using integral reinforcement learning

Homogenization for sub-riemannian Lagrangians

Robust tracking control with reinforcement learning for nonlinear‐constrained systems

Multi-agent Q-Learning control of spacecraft formation flying reconfiguration trajectories

Event-Triggered ADP for Tracking Control of Partially Unknown Constrained Uncertain Systems.

Adaptive dynamic programming‐based event‐triggered optimal tracking control

Average Cost Optimality in Partially Observed Lost-Sales Inventory Systems

Generalized value iteration for discounted optimal control with stability analysis

H∞ Tracking Control for Linear Discrete-Time Systems: Model-Free Q-Learning Designs

Model-free Optimal Tracking Control for an Aircraft Skin Inspection Robot with Constrained-input and Input Time-delay via Integral Reinforcement Learning

STOCHASTIC SETUP-COST INVENTORY MODEL WITH BACKORDERS AND QUASICONVEX COST FUNCTIONS

The Vanishing Discount Approach in a class of Zero-Sum Finite Games with Risk-Sensitive Average Criterion

Vanishing discount approximations in controlled Markov chains with risk-sensitive average criterion