Online model adaptation in Monte Carlo tree search planning

Maddalena Zuccotto,Alessandro Farinelli,Edoardo Fusa,Alberto Castellini

doi:10.1007/s11081-024-09896-2

Maddalena Zuccotto, Alessandro Farinelli + Show 2 more

Open Access

PDF Available

https://doi.org/10.1007/s11081-024-09896-2

Copy DOI

Export

Save

Cite

Journal: Optimization and Engineering	Publication Date: Jun 18, 2024
License type: CC BY 4.0

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

AbstractWe propose a model-based reinforcement learning method using Monte Carlo Tree Search planning. The approach assumes a black-box approximated model of the environment developed by an expert using any kind of modeling framework and it improves the model as new information from the environment is collected. This is crucial in real-world applications, since having a complete knowledge of complex environments is impractical. The expert’s model is first translated into a neural network and then it is updated periodically using data, i.e., state-action-next-state triplets, collected from the real environment. We propose three different methods to integrate data acquired from the environment with prior knowledge provided by the expert and we evaluate our approach on a domain concerning air quality and thermal comfort control in smart buildings. We compare the three proposed versions with standard Monte Carlo Tree Search planning using the expert’s model (without adaptation), Proximal Policy Optimization (a popular model-free DRL approach) and Stochastic Lower Bounds Optimization (a popular model-based DRL approach). Results show that our approach achieves the best results, outperforming all analyzed competitors.

Full Text