Improved POMDP Tree Search Planning with Prioritized Action Branching

John Mern,Lawrence Bush,Mykel J Kochenderfer,Anil Yildiz,Tapan Mukerji

doi:10.1609/aaai.v35i13.17412

Improved POMDP Tree Search Planning with Prioritized Action Branching

John Mern, Lawrence Bush + Show 3 more

Open Access

PDF Available

https://doi.org/10.1609/aaai.v35i13.17412

Copy DOI

Export

Save

Cite

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 6

Affiliation: Stanford University, General Motors (Poland)

#Large Action Spaces #Action Space #Expected Information Gain #Tree Expansion #Search Tree #Large Spaces #Highest Score #Proposed Method #Discrete Spaces #Discrete Action Spaces

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

Online solvers for partially observable Markov decision processes have difficulty scaling to problems with large action spaces. This paper proposes a method called PA-POMCPOW to sample a subset of the action space that provides varying mixtures of exploitation and exploration for inclusion in a search tree. The proposed method first evaluates the action space according to a score function that is a linear combination of expected reward and expected information gain. The actions with the highest score are then added to the search tree during tree expansion. Experiments show that PA-POMCPOW is able to outperform existing state-of-the-art solvers on problems with large discrete action spaces.

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Improved POMDP Tree Search Planning with Prioritized Action Branching