Formal Specification and Testing for Reinforcement Learning

Mahsa Varshosaz,Andrzej Wąsowski,Einar Broch Johnsen,Mohsen Ghaffari

doi:10.1145/3607835

Abstract

The development process for reinforcement learning applications is still exploratory rather than systematic. This exploratory nature reduces reuse of specifications between applications and increases the chances of introducing programming errors. This paper takes a step towards systematizing the development of reinforcement learning applications. We introduce a formal specification of reinforcement learning problems and algorithms, with a particular focus on temporal difference methods and their definitions in backup diagrams. We further develop a test harness for a large class of reinforcement learning applications based on temporal difference learning, including SARSA and Q-learning. The entire development is rooted in functional programming methods; starting with pure specifications and denotational semantics, ending with property-based testing and using compositional interpreters for a domain-specific term language as a test oracle for concrete implementations. We demonstrate the usefulness of this testing method on a number of examples, and evaluate with mutation testing. We show that our test suite is effective in killing mutants (90% mutants killed for 75% of subject agents). More importantly, almost half of all mutants are killed by generic write-once-use-everywhere tests that apply to any reinforcement learning problem modeled using our library, without any additional effort from the programmer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Formal Specification and Testing for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Journal: Proceedings of the ACM on Programming Languages	Publication Date: Aug 30, 2023
License type: cc-by

Similar Papers

Empirical Studies in Action Selection with Reinforcement Learning
Shimon Whiteson ... Peter Stone
Adaptive Behavior | VOL. 15
Shimon Whiteson, et. al.Shimon Whiteson ... Peter Stone
01 Mar 2007
Adaptive Behavior | VOL. 15

Gradient temporal-difference learning algorithms
...
-
, et. al. ...
01 Jan 2010
01 Jan 2010

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Kristopher De Asis ... Silviu Pitis
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34
Kristopher De Asis, et. al.Kristopher De Asis ... Silviu Pitis
03 Apr 2020
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 34

Three-link planar arm control using reinforcement learning
Wonchul Kim ... Sungwan Kim
-
Wonchul Kim, et. al.Wonchul Kim ... Sungwan Kim
01 Jun 2017
01 Jun 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Formal Specification and Testing for Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages