Plan-based reward shaping for multi-agent reinforcement learning

Sam Devlin,Daniel Kudenko

doi:10.1017/s0269888915000181

Plan-based reward shaping for multi-agent reinforcement learning

Sam Devlin, Daniel Kudenko

Open Access

https://doi.org/10.1017/s0269888915000181

Copy DOI

Journal: The Knowledge Engineering Review	Publication Date: Jan 1, 2016
Citations: 21

#Reward Shaping #Multi-agent Reinforcement Learning + Show 8 more

Abstract
Full-Text
Similar Papers

Abstract

AbstractRecent theoretical results have justified the use of potential-based reward shaping as a way to improve the performance of multi-agent reinforcement learning (MARL). However, the question remains of how to generate a useful potential function.Previous research demonstrated the use of STRIPS operator knowledge to automatically generate a potential function for single-agent reinforcement learning. Following up on this work, we investigate the use of STRIPS planning knowledge in the context of MARL.Our results show that a potential function based on joint or individual plan knowledge can significantly improve MARL performance compared with no shaping. In addition, we investigate the limitations of individual plan knowledge as a source of reward shaping in cases where the combination of individual agent plans causes conflict.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: The Knowledge Engineering Review

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.