Monte Carlo tree search control scheme for multibody dynamics applications

Yixuan Tang,Aki Mikkola,Grzegorz Orzechowski,Aleš Prokop

doi:10.1007/s11071-024-09509-8

Abstract

There is considerable interest in applying reinforcement learning (RL) to improve machine control across multiple industries, and the automotive industry is one of the prime examples. Monte Carlo Tree Search (MCTS) has emerged and proven powerful in decision-making games, even without understanding the rules. In this study, multibody system dynamics (MSD) control is first modeled as a Markov Decision Process and solved with Monte Carlo Tree Search. Based on randomized search space exploration, the MCTS framework builds a selective search tree by repeatedly applying a Monte Carlo rollout at each child node. However, without a library of available choices, deciding among the many possibilities for agent parameters can be intimidating. In addition, the MCTS poses a significant challenge for searching due to the large branching factor. This challenge is typically overcome by appropriate parameter design, search guiding, action reduction, parallelization, and early termination. To address these shortcomings, the overarching goal of this study is to provide needed insight into inverted pendulum controls via vanilla and modified MCTS agents, respectively. A series of reward functions are well-designed according to the control goal, which maps a specific distribution shape of reward bonus and guides the MCTS-based control to maintain the upright position. Numerical examples show that the reward-modified MCTS algorithms significantly improve the control performance and robustness of the default choice of a constant reward that constitutes the vanilla MCTS. The exponentially decaying reward functions perform better than the constant value or polynomial reward functions. Moreover, the exploitation vs. exploration trade-off and discount parameters are carefully tested. The study’s results can guide the research of RL-based MSD users.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Monte Carlo tree search control scheme for multibody dynamics applications

Abstract

Talk to us

Similar Papers

More From: Nonlinear Dynamics

Lead the way for us

Journal: Nonlinear Dynamics	Publication Date: Apr 3, 2024
License type: CC BY 4.0

Similar Papers

Reinforcement Learning for the Agile Earth-Observing Satellite Scheduling Problem
Adam Herrmann ... Hanspeter Schaub
IEEE Transactions on Aerospace and Electronic Systems | VOL. -
Adam Herrmann, et. al.Adam Herrmann ... Hanspeter Schaub
01 Jan 2023
IEEE Transactions on Aerospace and Electronic Systems | VOL. -

EXPLORING THE LIMITS OF MCTS IN PAC-MAN: MAZE SIZE, SIMULATIONS, AND PERFORMANCE
Artem Novikov ... Volodymyr Yanovsky
Herald of Khmelnytskyi National University. Technical sciences | VOL. 341
Artem Novikov, et. al.Artem Novikov ... Volodymyr Yanovsky
31 Oct 2024
Herald of Khmelnytskyi National University. Technical sciences | VOL. 341

Adaptive Reward for CAV Action Planning using Monte Carlo Tree Search
Dhruvkumar Patel ... Rym Zalila-Wenkstern
-
Dhruvkumar Patel, et. al.Dhruvkumar Patel ... Rym Zalila-Wenkstern
19 Sep 2021
19 Sep 2021

AlphaTruss: Monte Carlo Tree Search for Optimal Truss Layout Design
Ruifeng Luo ... Xianzhong Zhao
Buildings | VOL. 12
Ruifeng Luo, et. al.Ruifeng Luo ... Xianzhong Zhao
11 May 2022
Buildings | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Monte Carlo tree search control scheme for multibody dynamics applications

Abstract

Talk to us

Similar Papers

More From: Nonlinear Dynamics