Understanding Sample Generation Strategies for Learning Heuristic Functions in Classical Planning

Rafael V Bettker,Marcus Ritt,André G Pereira,Pedro P Minini

doi:10.1613/jair.1.15742

Abstract

We study the problem of learning good heuristic functions for classical planning tasks with neural networks based on samples represented by states with their cost-to-goal estimates. The heuristic function is learned for a state space and goal condition with the number of samples limited to a fraction of the size of the state space, and must generalize well for all states of the state space with the same goal condition. Our main goal is to better understand the influence of sample generation strategies on the performance of a greedy best-first heuristic search (GBFS) guided by a learned heuristic function. In a set of controlled experiments, we find that two main factors determine the quality of the learned heuristic: the algorithm used to generate the sample set and how close the sample estimates to the perfect cost-to-goal are. These two factors are dependent: having perfect cost-to-goal estimates is insufficient if the samples are not well distributed across the state space. We also study other effects, such as adding samples with high-value estimates. Based on our findings, we propose practical strategies to improve the quality of learned heuristics: three strategies that aim to generate more representative states and two strategies that improve the cost-to-goal estimates. Our practical strategies result in a learned heuristic that, when guiding a GBFS algorithm, increases by more than 30% the mean coverage compared to a baseline learned heuristic.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Understanding Sample Generation Strategies for Learning Heuristic Functions in Classical Planning

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Journal: Journal of Artificial Intelligence Research	Publication Date: Jun 2, 2024
License type: cc-by

Similar Papers

The Fast Downward Planning System
M Helmert
Journal of Artificial Intelligence Research | VOL. 26
M HelmertM Helmert
12 Jul 2006
Journal of Artificial Intelligence Research | VOL. 26

A Novel Technique for Avoiding Plateaus of Greedy Best-First Search in Satisficing Planning
Tatsuya Imai ... Akihiro Kishimoto
Proceedings of the International Symposium on Combinatorial Search | VOL. 2
Tatsuya Imai, et. al.Tatsuya Imai ... Akihiro Kishimoto
19 Aug 2021
Proceedings of the International Symposium on Combinatorial Search | VOL. 2

Policy-Guided Heuristic Search with Guarantees
Laurent Orseau ... Levi H S Lelis
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Laurent Orseau, et. al.Laurent Orseau ... Levi H S Lelis
18 May 2021
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35

Ant-TD: Ant colony optimization plus temporal difference reinforcement learning for multi-label feature selection
Mohsen Paniri ... Mohammad Bagher Dowlatshahi
Swarm and Evolutionary Computation | VOL. 64
Mohsen Paniri, et. al.Mohsen Paniri ... Mohammad Bagher Dowlatshahi
27 Apr 2021
Swarm and Evolutionary Computation | VOL. 64

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Understanding Sample Generation Strategies for Learning Heuristic Functions in Classical Planning

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research