액터-크리틱 알고리즘을 이용한 리액션 휠 진자의 스윙업 제어

Dae-Won Kim,Hee Jae Park

doi:10.5302/j.icros.2021.21.0057

Abstract

In this study, we verified the performance of the swing-up control method for a reaction wheel pendulum using the actor-critic algorithm in both simulation and experiment and suggested the possibility that reinforcement learning, using shallow neural networks, can be applied to studying intelligent robots that act in real-world environments, such as a robot that teaches itself to walk through trial and error. The actor of the proposed actor-critic algorithm used the policy network to determine the rotational direction of the reaction wheel based on the angular position and velocity of the pendulum and the angular velocity of the reaction wheel. The critic used the value network to estimate the expected reward based on the same factors as the actor’s. In both simulation and in the real-world environment, through trial and error, the proposed algorithm successfully learned how to swing up and stabilize the pendulum by choosing the rotational direction ‒ between the clockwise and counter-clockwise directions ‒ of the reaction wheel.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

액터-크리틱 알고리즘을 이용한 리액션 휠 진자의 스윙업 제어

Abstract

Talk to us

Similar Papers

More From: Journal of Institute of Control, Robotics and Systems

Lead the way for us

Similar Papers

Single-Axis, Spin-to-Spin Slew Maneuvers Under a Finite Jerk Constraint
Donghun Lee ... Young-Joo Song
Journal of Guidance, Control, and Dynamics | VOL. 42
Donghun Lee, et. al.Donghun Lee ... Young-Joo Song
02 May 2019
Journal of Guidance, Control, and Dynamics | VOL. 42

A sandpile model for reliable actor-critic reinforcement learning
Yiming Peng ... Shaoning Pang
-
Yiming Peng, et. al.Yiming Peng ... Shaoning Pang
01 May 2017
01 May 2017

Improved method for precise shaft angle oscillation and angular velocity measurement: (With simultaneous sampling of other analog signals using NI DAQ Cards)
Richard Schreiber
-
Richard SchreiberRichard Schreiber
01 May 2017
01 May 2017

Reward-Punishment Actor-Critic Algorithm Applying to Robotic Non-grasping Manipulation
Taisuke Kobayashi ... Gordon Cheng
-
Taisuke Kobayashi, et. al.Taisuke Kobayashi ... Gordon Cheng
01 Aug 2019
01 Aug 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

액터-크리틱 알고리즘을 이용한 리액션 휠 진자의 스윙업 제어

Abstract

Talk to us

Similar Papers

More From: Journal of Institute of Control, Robotics and Systems