Learn Control Policies Research Articles

This paper investigates the robust optimal control problem of a class of continuous-time, partially linear, interconnected systems. In addition to the dynamic uncertainties resulted from the interconnected dynamic system, unknown bounded disturbances are taken into account throughout the learning process, wherein the system’s dynamics and the disturbances are assumed unknown. These challenges lead the collected online data to be imperfect. In this scenario, traditional data-driven control techniques, such as adaptive dynamic programming (ADP) and robust ADP, encounter a challenge in approximating the optimal control policy precisely due to imperfect data and computational errors. In this paper, a novel data-driven robust policy iteration method is proposed to simultaneously solve the robust optimal control problems. Without relying on the knowledge of the system’s dynamics, the external disturbances or the complete state, the implementation of the proposed method only needs to access the input and partial state information. Based on the small-gain theorem, the notions of strong unboundedness observability and input-to-output stability, it is guaranteed that the learned robust optimal control gain is stabilizing and that the solution of the closed-loop system is uniformly ultimately bounded despite the existence of dynamic uncertainties and unknown external disturbances. The simulation results reveal the efficiency and practicality of the proposed data-driven control method. <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">Note to Practitioners</i> —This work is motivated by the use of reinforcement learning to improve the quality of designing adaptive optimal controllers for engineering applications. Adaptive dynamic programming methods, in particular policy iteration (PI), are widely used in solving optimal control problems. However, due to the iterative nature of PI, the approximated optimal control policy may be inaccurate and imprecise. Especially, when using imperfect system’s measurements instead of the modelling information. This can result in causing the learned control policy to deviate from the actual optimal policy. This becomes more challenging in the existence of dynamic uncertainties and unknown external disturbances which corrupt the measurements and result in imperfect data. This work investigates the conditions on the uncertainties such that the proposed novel data-driven PI algorithm is robust to system’s uncertainties, unknown external disturbances and imperfect measurements. The approximated robust optimal control policy performs robustly in the existences of imperfect data and uncertainties, and at the same time is close enough to the optimal control policy.

Read full abstract

Autonomous driving has the potential to revolutionize mobility and transportation by reducing road accidents, alleviating traffic congestion, and mitigating air pollution. This transformation can result in energy efficiency, enhanced convenience, and increased productivity, as valuable driving time can be repurposed for other activities. The main objective of this paper is to provide a comprehensive technical survey of the latest research in the field of lateral, longitudinal, and integrated control techniques for autonomous vehicles. The survey aims to explore a wide range of techniques and methodologies employed to achieve precise steering control while also considering longitudinal aspects. Model-based control techniques form the foundation for control, utilizing mathematical models of vehicle dynamics to design controllers that effectively track desired speeds and/or steering behavior. Unlike model-free control techniques such as reinforcement learning and deep learning algorithms facilitate the integration of longitudinal and lateral control by learning control policies directly from data and without explicit knowledge of the underlying dynamics. Through this survey, the paper delves into the strengths, limitations, and advancements in both model-based and model-free control approaches for autonomous vehicles. It investigates their performance in real-world scenarios and addresses the technical challenges associated with their implementation. These challenges may include uncertainties in the environment, adaptability to dynamic conditions, robustness, safety considerations, and computational complexity.

Read full abstract

Learn Control Policies Research Articles

Related Topics

Articles published on Learn Control Policies

CLIP feature-based randomized control using images and text for multiple tasks and robots

Deep reinforcement learning for optimal control of induction welding process

Robust Policy Iteration of Uncertain Interconnected Systems With Imperfect Data

Learning to stand with sensorimotor delays generalizes across directions and from hand to leg effectors

FPGA-Accelerated Sim-to-Real Control Policy Learning for Robotic Arms

Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance

End-to-end decentralized formation control using a graph neural network-based learning method.

Personalized robotic control via constrained multi-objective reinforcement learning

Autonomous Guidance Between Quasiperiodic Orbits in Cislunar Space via Deep Reinforcement Learning

Learning Control Policies for Stochastic Systems with Reach-Avoid Guarantees

Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

Active Classification of Moving Targets With Learned Control Policies

Model-Based Control and Model-Free Control Techniques for Autonomous Vehicles: A Technical Survey

RegRL-KG: Learning an L1 regularized reinforcement agent for keyphrase generation

Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Comparison of Learning Spacecraft Path-Planning Solutions from Imitation in Three-Body Dynamics

Emerging robust and data‐driven control methods for uncertain learning systems

Synthesizing Get‐Up Motions for Physics‐based Characters

Hierarchical Planning with Deep Reinforcement Learning for 3D Navigation of Microrobots in Blood Vessels

A general motion control architecture for an autonomous underwater vehicle with actuator faults and unknown disturbances through deep reinforcement learning

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Learn Control Policies Research Articles

Related Topics

Articles published on Learn Control Policies

CLIP feature-based randomized control using images and text for multiple tasks and robots

Deep reinforcement learning for optimal control of induction welding process

Robust Policy Iteration of Uncertain Interconnected Systems With Imperfect Data

Learning to stand with sensorimotor delays generalizes across directions and from hand to leg effectors

FPGA-Accelerated Sim-to-Real Control Policy Learning for Robotic Arms

Imitation learning from imperfect demonstrations for AUV path tracking and obstacle avoidance

End-to-end decentralized formation control using a graph neural network-based learning method.

Personalized robotic control via constrained multi-objective reinforcement learning

Autonomous Guidance Between Quasiperiodic Orbits in Cislunar Space via Deep Reinforcement Learning

Learning Control Policies for Stochastic Systems with Reach-Avoid Guarantees

Multiagent Meta-Reinforcement Learning for Optimized Task Scheduling in Heterogeneous Edge Computing Systems

Active Classification of Moving Targets With Learned Control Policies

Model-Based Control and Model-Free Control Techniques for Autonomous Vehicles: A Technical Survey

RegRL-KG: Learning an L1 regularized reinforcement agent for keyphrase generation

Toward a Theoretical Foundation of Policy Optimization for Learning Control Policies

Comparison of Learning Spacecraft Path-Planning Solutions from Imitation in Three-Body Dynamics

Emerging robust and data‐driven control methods for uncertain learning systems

Synthesizing Get‐Up Motions for Physics‐based Characters

Hierarchical Planning with Deep Reinforcement Learning for 3D Navigation of Microrobots in Blood Vessels

A general motion control architecture for an autonomous underwater vehicle with actuator faults and unknown disturbances through deep reinforcement learning