On safe sequential optimization using posterior sampling

Pratik Kar,Vineeth Bala Sukumaran,S Sumitra

doi:10.1109/spcom55316.2022.9840822

Abstract

We consider the problem of designing posterior sampling based sequential optimization policies for maximizing a blackbox function subject to safety constraints. Posterior sampling algorithms, which are easier to implement, have met with empirical success for blackbox maximization problems without safety constraints. We consider whether posterior sampling algorithms which satisfy safety constraints have good performance with respect to achieving the global maxima while minimizing the number of safety constraint violations. We propose a safe Gaussian process Thompson Sampling algorithm for safe maximization of a blackbox function. The algorithm uses a sample estimate of safe set in order to meet safety constraints and uses a mutual information based acquisition function in order to improve the estimate of the safe set. We evaluate the performance of the proposed policy with respect to prior work using simulations. We observe that the proposed policy achieves similar behaviour compared to prior work for safety violations while achieving the global maximum.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

On safe sequential optimization using posterior sampling

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Learning to Optimize via Posterior Sampling
Daniel Russo ... Benjamin Van Roy
Mathematics of Operations Research | VOL. 39
Daniel Russo, et. al.Daniel Russo ... Benjamin Van Roy
01 Nov 2014
Mathematics of Operations Research | VOL. 39

Safe Spacecraft Inspection via Deep Reinforcement Learning and Discrete Control Barrier Functions
David Van Wijk ... Kerianne Hobbs
Journal of Aerospace Information Systems | VOL. -
David Van Wijk, et. al.David Van Wijk ... Kerianne Hobbs
01 Sep 2024
Journal of Aerospace Information Systems | VOL. -

High-Order Control Barrier Functions
Wei Xiao ... Calin Belta
IEEE Transactions on Automatic Control | VOL. 67
Wei Xiao, et. al.Wei Xiao ... Calin Belta
01 Jul 2022
IEEE Transactions on Automatic Control | VOL. 67

Distributed Cooperative Control of Redundant Mobile Manipulators With Safety Constraints.
Chu Wu ... Xianlin Zeng
IEEE Transactions on Cybernetics | VOL. 53
Chu Wu, et. al.Chu Wu ... Xianlin Zeng
01 Feb 2023
IEEE Transactions on Cybernetics | VOL. 53

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

On safe sequential optimization using posterior sampling

Abstract

Talk to us

Similar Papers