An Oracle-Guided Approach to Constrained Policy Synthesis Under Uncertainty

Roman Andriushchenko,Milan Češka,Filip Macák,Sebastian Junges,Joost-Pieter Katoen

doi:10.1613/jair.1.16593

Roman Andriushchenko, Milan Češka + Show 3 more

Open Access

https://doi.org/10.1613/jair.1.16593

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Dealing with aleatoric uncertainty is key in many domains involving sequential decision making, e.g., planning in AI, network protocols, and symbolic program synthesis. This paper presents a general-purpose model-based framework to obtain policies operating in uncertain environments in a fully automated manner. The new concept of coloured Markov Decision Processes (MDPs) enables a succinct representation of a wide range of synthesis problems. A coloured MDP describes a collection of possible policy configurations with their structural dependencies. The framework covers the synthesis of (a) programmatic policies from probabilistic program sketches and (b) finite-state controllers representing policies for partially observable MDPs (POMDPs), including decentralised POMDPs as well as constrained POMDPs. We show that all these synthesis problems can be cast as exploring memoryless policies in the corresponding coloured MDP. This exploration uses a symbiosis of two orthogonal techniques: abstraction refinement—using a novel refinement method—and counter-example generalisation. Our approach outperforms dedicated synthesis techniques on some problems and significantly improves an earlier version of this framework.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

An Oracle-Guided Approach to Constrained Policy Synthesis Under Uncertainty

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Journal: Journal of Artificial Intelligence Research	Publication Date: Feb 3, 2025
License type: cc-by

Similar Papers

Reduction of aPOMDPto anMDP
Burhaneddin Sandikçi
-
Burhaneddin SandikçiBurhaneddin Sandikçi
01 Jan 2010
01 Jan 2010

Contraction Mappings in the Theory Underlying Dynamic Programming
Eric V Denardo
SIAM Review | VOL. 9
Eric V DenardoEric V Denardo
01 Apr 1967
SIAM Review | VOL. 9

Closing the learning-planning loop with predictive state representations
...
-
, et. al. ...
10 May 2010
10 May 2010

Optimal maintenance policies for three-states POMDP with quality measurement errors
Mohammad M Aldurgam ... Salih O Duffuaa
-
Mohammad M Aldurgam, et. al.Mohammad M Aldurgam ... Salih O Duffuaa
01 Dec 2010
01 Dec 2010

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

An Oracle-Guided Approach to Constrained Policy Synthesis Under Uncertainty

Abstract

Published Version

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research