Double Inverted Pendulum Research Articles

Inverted pendulums are very often used to verify many control theories, because they are typical unstable systems and also are interesting objects. However, few innovative methods with respect to the adaptive control of inverted pendulum (IP) are reported. This paper presents a stabilization method by using the adaptive control for a serial rotary-type double inverted pendulum (SRDIP) whose whole basic parameters are unknown. The control system of a SRDIP is achieved by separating the control mode to two stages. The first stage is an adaptive control mode of the single IP placing the second IP in downward directions and another stage is a LQ control mode of the SRDIP. The control system prepares two kinds of adaptive controllers, which are a variable structure system (VSS) robust adaptive control and a self-tuning control (STC). The rotational angle of the first IP is stabilized by the VSS adaptive control, and the stability of the rotary arm is also achieved by constructing the STC that guarantees the boundary reference angle of the first IP. It is then difficult to construct the STC by only adjustable parameters of the VSS adaptive control system. Whole basic parameters of a SRDIP are estimated by adopting the recursive least squares (RLS) estimation method in order to accomplish both of the STC system and the LQ design of a SRDIP. The RLS algorithm is performed by superposing an available perturbation signal to the adaptive manipulated variable on a limited short interval. The STC system updates a LQ controller based on the QR methods devised for the real time operation. Before completing the first stage, a LQ controller for the SRDIP is obtained through a state space description from the estimated basic parameters. The control law is changed to a LQ control for the SRDIP from an adaptive control mode in the second stage. Finally, it is verified by simulation studies and practical experiments that the proposed system is useful as a control strategy of the SRDIP with an unknown parameter.

Read full abstract

The stabilization control of nonholonomic systems have been extensively studied because it is essential for nonholonomic robot control problems. The difficulty in this problem is that the theoretical derivation of control policy is not necessarily guaranteed achievable. In this paper, we present a reinforcement learning (RL) method with instance-based policy (IBP) representation, in which control policies for this class are optimized with respect to user-defined cost functions. Direct policy search (DPS) is an approach for RL; the policy is represented by parametric models and the model parameters are directly searched by optimization techniques including genetic algorithms (GAs). In IBP representation an instance consists of a state and an action pair; a policy consists of a set of instances. Several DPSs with IBP have been previously proposed. In these methods, sometimes fail to obtain optimal control policies when state-action variables are continuous. In this paper, we present a real-coded GA for DPSs with IBP. Our method is specifically designed for continuous domains. Optimization of IBP has three difficulties; high-dimensionality, epistasis, and multi-modality. Our solution is designed for overcoming these difficulties. The policy search with IBP representation appears to be high-dimensional optimization; however, instances which can improve the fitness are often limited to active instances (instances used for the evaluation). In fact, the number of active instances is small. Therefore, we treat the search problem as a low dimensional problem by restricting search variables only to active instances. It has been commonly known that functions with epistasis can be efficiently optimized with crossovers which satisfy the inheritance of statistics. For efficient search of IBP, we propose extended crossover-like mutation (extended XLM) which generates a new instance around an instance with satisfying the inheritance of statistics. For overcoming multi-modality, we propose extended CCM for selection. Extended CCM always chooses the child for next generation among children and a parent which generates the children. By doing so, the diversity of the population is expected to be well maintained. Our proposals, FLIP (Functionally sophisticated Learner for IBP), consist of extended XLM and extended CCM. The effectiveness of FLIP is shown by experiments with nonholonomic control problems, a space robot, a car-like robot, and a parallel-type double inverted pendulum.

Read full abstract

Double Inverted Pendulum Research Articles

Related Topics

Articles published on Double Inverted Pendulum

Research of DD2UD with Human Simulating Intelligent Control for a Double Inverted Pendulum

Hybrid NN predictive-based LQR controller for rotary double inverted pendulum systems: an analytical study

Vibration Suppression of Two-Wheel Mobile Manipulator Using Resonance-Ratio-Control-Based Null-Space Control

Reinforcement learning using swarm intelligence-trained neural networks

Watching quiet human stance to shake off its straitjacket

A Robust Control of Two-Wheeled Mobile Manipulator with Underactuated Joint by Nonlinear Backstepping Method

A simple new device to examine human stance: the totter-slab

適応制御器を用いた回転型二重倒立振子の二段階による制御系

폴리토픽 모델을 갖는 대규모 시스템을 위한 비집중화 슬라이딩 모드 제어기 설계

PID-Like Neural Network Nonlinear Adaptive Control for Uncertain Multivariable Motion Control Systems

Mechatronic System Design Applied to Double Inverted Pendulum with Flywheel Actuator

Non-linear control of under-actuated mechanical systems

Instance-based Policy Learning by Real-coded Genetic Algorithms and Its Application to Control of Nonholonomic Systems

Global stabilization of a double inverted pendulum with control at the hinge between the links

Discrete-Time Control of Linear Time-Periodic Systems

Stabilization of linear undamped systems via position and delayed position feedbacks

Novel active fault-tolerant control scheme and its application to a double inverted pendulum system

Analysis and experiment on simultaneous swing‐up of a parallel cart‐type double inverted pendulum

Stability Improvement of Two Wheel Driven Mobile Manipulator Using Nonlinear PD Controller

LMI based output-feedback controllers: γ-optimal versus linear quadratic

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Double Inverted Pendulum Research Articles

Related Topics

Articles published on Double Inverted Pendulum

Research of DD2UD with Human Simulating Intelligent Control for a Double Inverted Pendulum

Hybrid NN predictive-based LQR controller for rotary double inverted pendulum systems: an analytical study

Vibration Suppression of Two-Wheel Mobile Manipulator Using Resonance-Ratio-Control-Based Null-Space Control

Reinforcement learning using swarm intelligence-trained neural networks

Watching quiet human stance to shake off its straitjacket

A Robust Control of Two-Wheeled Mobile Manipulator with Underactuated Joint by Nonlinear Backstepping Method

A simple new device to examine human stance: the totter-slab

適応制御器を用いた回転型二重倒立振子の二段階による制御系

폴리토픽 모델을 갖는 대규모 시스템을 위한 비집중화 슬라이딩 모드 제어기 설계

PID-Like Neural Network Nonlinear Adaptive Control for Uncertain Multivariable Motion Control Systems

Mechatronic System Design Applied to Double Inverted Pendulum with Flywheel Actuator

Non-linear control of under-actuated mechanical systems

Instance-based Policy Learning by Real-coded Genetic Algorithms and Its Application to Control of Nonholonomic Systems

Global stabilization of a double inverted pendulum with control at the hinge between the links

Discrete-Time Control of Linear Time-Periodic Systems

Stabilization of linear undamped systems via position and delayed position feedbacks

Novel active fault-tolerant control scheme and its application to a double inverted pendulum system

Analysis and experiment on simultaneous swing‐up of a parallel cart‐type double inverted pendulum

Stability Improvement of Two Wheel Driven Mobile Manipulator Using Nonlinear PD Controller

LMI based output-feedback controllers: γ-optimal versus linear quadratic