Risk-Aware Continuous Control with Neural Contextual Bandits

Jose A Ayala-Romero,Andres Garcia-Saavedra,Xavier Costa-Perez

doi:10.1609/aaai.v38i19.30083

Abstract

Recent advances in learning techniques have garnered attention for their applicability to a diverse range of real-world sequential decision-making problems. Yet, many practical applications have critical constraints for operation in real environments. Most learning solutions often neglect the risk of failing to meet these constraints, hindering their implementation in real-world contexts. In this paper, we propose a risk-aware decision-making framework for contextual bandit problems, accommodating constraints and continuous action spaces. Our approach employs an actor multi-critic architecture, with each critic characterizing the distribution of performance and constraint metrics. Our framework is designed to cater to various risk levels, effectively balancing constraint satisfaction against performance. To demonstrate the effectiveness of our approach, we first compare it against state-of-the-art baseline methods in a synthetic environment, highlighting the impact of intrinsic environmental noise across different risk configurations. Finally, we evaluate our framework in a real-world use case involving a 5G mobile network where only our approach satisfies consistently the system constraint (a signal processing reliability target) with a small performance toll (8.5% increase in power consumption).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Risk-Aware Continuous Control with Neural Contextual Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Similar Papers

Maximum Entropy Exploration in Contextual Bandits with Neural Networks and Energy Based Models.
Adam Elwood ... Alessandro Rozza
Entropy (Basel, Switzerland) | VOL. 25
Adam Elwood, et. al.Adam Elwood ... Alessandro Rozza
18 Jan 2023
Entropy (Basel, Switzerland) | VOL. 25

The impact of environmental noise on song amplitude in a territorial bird
Henrik Brumm
Journal of Animal Ecology | VOL. 73
Henrik BrummHenrik Brumm
16 Apr 2004
Journal of Animal Ecology | VOL. 73

Reinforcement Learning for Bandits with Continuous Actions and Large Context Spaces
Paul Duckworth ... Bruno Lacerda
-
Paul Duckworth, et. al.Paul Duckworth ... Bruno Lacerda
28 Sep 2023
28 Sep 2023

The impact of environmental noise on robot-assisted laparoscopic surgical performance
Ka-Chun Siu ... Nick Stergiou
Surgery | VOL. 147
Ka-Chun Siu, et. al.Ka-Chun Siu ... Nick Stergiou
30 Oct 2009
Surgery | VOL. 147

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Risk-Aware Continuous Control with Neural Contextual Bandits

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence