Real-Time Symbolic Dynamic Programming

Luis Vianna,Leliane De Barros,Scott Sanner

doi:10.1609/aaai.v29i1.9651

Abstract

Recent advances in Symbolic Dynamic Programming (SDP) combined withthe extended algebraic decision diagram (XADD) have provided exactsolutions for expressive subclasses of finite-horizon Hybrid MarkovDecision Processes (HMDPs) with mixed continuous and discrete stateand action parameters. Unfortunately, SDP suffers from two majordrawbacks: (1) it solves for all states and can be intractable formany problems that inherently have large optimal XADD value functionrepresentations; and (2) it cannot maintain compact (pruned) XADDrepresentations for domains with nonlinear dynamics and reward due tothe need for nonlinear constraint checking. In this work, wesimultaneously address both of these problems by introducing real-timeSDP (RTSDP). RTSDP addresses (1) by focusing the solution and valuerepresentation only on regions reachable from a set of initial statesand RTSDP addresses (2) by using visited states as witnesses ofreachable regions to assist in pruning irrelevant or unreachable(nonlinear) regions of the value function. To this end, RTSDP enjoysprovable convergence over the set of initial states and substantialspace and time savings over SDP as we demonstrate in a variety of hybrid domains ranging from inventory to reservoir to traffic control.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Real-Time Symbolic Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: Mar 4, 2015
Citations: 4

Similar Papers

Algorithms by Design: Part III—A Novel Normalized Time Weighted Residual Methodology and Design of Optimal Symplectic-Momentum Based Controllable Numerical Dissipative Algorithms for Nonlinear Structural Dynamics
S Masuri ... K K Tamma
International Journal for Computational Methods in Engineering Science and Mechanics | VOL. 10
S Masuri, et. al.S Masuri ... K K Tamma
12 Feb 2009
International Journal for Computational Methods in Engineering Science and Mechanics | VOL. 10

Eye-hand coordination all the way: from discrete to continuous hand movements.
Adrien Coudiere ... Frederic R Danion
Journal of neurophysiology | VOL. 131
Adrien Coudiere, et. al.Adrien Coudiere ... Frederic R Danion
21 Feb 2024
Journal of neurophysiology | VOL. 131

Action decoupled SAC reinforcement learning with discrete-continuous hybrid action spaces
Yahao Xu ... Hongbin Deng
Neurocomputing | VOL. 537
Yahao Xu, et. al.Yahao Xu ... Hongbin Deng
31 Mar 2023
Neurocomputing | VOL. 537

Cooperative offensive decision-making for soccer robots based on bi-channel Q-value evaluation MADDPG
Lingli Yu ... Kaijun Zhou
Engineering Applications of Artificial Intelligence | VOL. 121
Lingli Yu, et. al.Lingli Yu ... Kaijun Zhou
21 Feb 2023
Engineering Applications of Artificial Intelligence | VOL. 121

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Real-Time Symbolic Dynamic Programming

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence