Reward Augmentation in Reinforcement Learning for Testing Distributed Systems

Andrea Borgarelli,Rupak Majumdar,Constantin Enea,Srinidhi Nagendra

doi:10.1145/3689779

Abstract

Bugs in popular distributed protocol implementations have been the source of many downtimes in popular internet services. We describe a randomized testing approach for distributed protocol implementations based on reinforcement learning. Since the natural reward structure is very sparse, the key to successful exploration in reinforcement learning is reward augmentation. We show two different techniques that build on one another. First, we provide a decaying exploration bonus based on the discovery of new states---the reward decays as the same state is visited multiple times. The exploration bonus captures the intuition from coverage-guided fuzzing of prioritizing new coverage points; in contrast to other schemes, we show that taking the maximum of the bonus and the Q-value leads to more effective exploration. Second, we provide waypoints to the algorithm as a sequence of predicates that capture interesting semantic scenarios. Waypoints exploit designer insight about the protocol and guide the exploration to "interesting" parts of the state space. Our reward structure ensures that new episodes can reliably get to deep interesting states even without execution caching. We have implemented our algorithm in Go. Our evaluation on three large benchmarks (RedisRaft, Etcd, and RSL) shows that our algorithm can significantly outperform baseline approaches in terms of coverage and bug finding.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Reward Augmentation in Reinforcement Learning for Testing Distributed Systems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages

Lead the way for us

Similar Papers

Reward Space Noise for Exploration in Deep Reinforcement Learning
Chuxiong Sun ... Xiaohui Hu
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35
Chuxiong Sun, et. al.Chuxiong Sun ... Xiaohui Hu
21 May 2021
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 35

Strategic Exploration in Reinforcement Learning - New Algorithms and Learning Guarantees

-

24 Feb 2020
24 Feb 2020

Efficient Exploration in Reinforcement Learning

-

07 Feb 2012
07 Feb 2012

Off-Policy Reinforcement Learning for Robotics

-

30 Mar 2021
30 Mar 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reward Augmentation in Reinforcement Learning for Testing Distributed Systems

Abstract

Talk to us

Similar Papers

More From: Proceedings of the ACM on Programming Languages