Abstract
We consider online planning in Markov decision processes (MDPs). In online planning, the agent focuses on its current state only, deliberates about the set of possible policies from that state onwa...
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have