A framework for safe decision making: A convex duality approach

Martino Bernasconi,Matteo Castiglioni,Federico Cacciamani

doi:10.3233/ia-230008

Abstract

We study the problem of online interaction in general decision making problems, where the objective is not only to find optimal strategies, but also to satisfy certain safety guarantees, expressed in terms of costs accrued. In particular, we focus on the online learning problem in which an agent has to find the optimal solution of a linear objective. Moreover, the agent has to satisfy a linear safety constraint at each round. We propose a theoretical framework to address such problems and present BAN-SOLO, a UCB-like algorithm that, in an online interaction with an unknown environment, attains sublinear regret of order O ( T ) and satisfies a safety constraint with high probability at each iteration. BAN-SOLO provides a general framework that can be applied to any setting in which estimators of the objective and the cost function are available. At its core, it relies on tools from convex duality to manage environment exploration while satisfying the safety constraint imposed by the problem. To show the applicability of our framework, we provide two game theoretical applications: normal-form games and sequential decision-making problems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A framework for safe decision making: A convex duality approach

Abstract

Talk to us

Similar Papers

More From: Intelligenza Artificiale

Lead the way for us

Similar Papers

On the correctness of monadic backward induction
Nuria Brede ... Nicola Botta
Journal of Functional Programming | VOL. 31
Nuria Brede, et. al.Nuria Brede ... Nicola Botta
01 Jan 2020
Journal of Functional Programming | VOL. 31

Safe Online Convex Optimization with Unknown Linear Safety Constraints
Sapana Chaudhary ... Dileep Kalathil
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36
Sapana Chaudhary, et. al.Sapana Chaudhary ... Dileep Kalathil
28 Jun 2022
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 36

A Framework for UAV Navigation and Exploration in GPS-Denied Environments
Fernando Vanegas ... Jonathan Roberts
-
Fernando Vanegas, et. al.Fernando Vanegas ... Jonathan Roberts
01 Mar 2019
01 Mar 2019

Safe Linear Thompson Sampling With Side Information
Ahmadreza Moradipari ... Sanae Amani
IEEE Transactions on Signal Processing | VOL. 69
Ahmadreza Moradipari, et. al.Ahmadreza Moradipari ... Sanae Amani
01 Jan 2020
IEEE Transactions on Signal Processing | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A framework for safe decision making: A convex duality approach

Abstract

Talk to us

Similar Papers

More From: Intelligenza Artificiale