Avoiding Negative Side Effects of Autonomous Systems in the Open World

Sandhya Saisubramanian,Ece Kamar,Shlomo Zilberstein

doi:10.1613/jair.1.13581

Abstract

Autonomous systems that operate in the open world often use incomplete models of their environment. Model incompleteness is inevitable due to the practical limitations in precise model specification and data collection about open-world environments. Due to the limited fidelity of the model, agent actions may produce negative side effects (NSEs) when deployed. Negative side effects are undesirable, unmodeled effects of agent actions on the environment. NSEs are inherently challenging to identify at design time and may affect the reliability, usability and safety of the system. We present two complementary approaches to mitigate the NSE via: (1) learning from feedback, and (2) environment shaping. The solution approaches target settings with different assumptions and agent responsibilities. In learning from feedback, the agent learns a penalty function associated with a NSE. We investigate the efficiency of different feedback mechanisms, including human feedback and autonomous exploration. The problem is formulated as a multi-objective Markov decision process such that optimizing the agent’s assigned task is prioritized over mitigating NSE. A slack parameter denotes the maximum allowed deviation from the optimal expected reward for the agent’s task in order to mitigate NSE. In environment shaping, we examine how a human can assist an agent, beyond providing feedback, and utilize their broader scope of knowledge to mitigate the impacts of NSE. We formulate the problem as a human-agent collaboration with decoupled objectives. The agent optimizes its assigned task and may produce NSE during its operation. The human assists the agent by performing modest reconfigurations of the environment so as to mitigate the impacts of NSE, without affecting the agent’s ability to complete its assigned task. We present an algorithm for shaping and analyze its properties. Empirical evaluations demonstrate the trade-offs in the performance of different approaches in mitigating NSE in different settings.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Artificial Intelligence Research	Publication Date: May 10, 2022
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

Avoiding Negative Side Effects of Autonomous Systems in the Open World

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research

Lead the way for us

Similar Papers

A Multi-Objective Approach to Mitigate Negative Side Effects
Sandhya Saisubramanian ... Ece Kamar
-
Sandhya Saisubramanian, et. al.Sandhya Saisubramanian ... Ece Kamar
01 Jul 2020
01 Jul 2020

Understanding User Attitudes Towards Negative Side Effects of AI Systems
Sandhya Saisubramanian ... Shannon C Roberts
-
Sandhya Saisubramanian, et. al.Sandhya Saisubramanian ... Shannon C Roberts
08 May 2021
08 May 2021

Planning and Learning for Non-markovian Negative Side Effects Using Finite State Controllers
Aishwarya Srivastava ... Sandhya Saisubramanian
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37
Aishwarya Srivastava, et. al.Aishwarya Srivastava ... Sandhya Saisubramanian
26 Jun 2023
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 37

Avoiding Negative Side Effects due to Incomplete Knowledge of AI Systems
Sandhya Saisubramanian ... Shlomo Zilberstein
AI Magazine | VOL. 42
Sandhya Saisubramanian, et. al.Sandhya Saisubramanian ... Shlomo Zilberstein
01 Dec 2021
AI Magazine | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Avoiding Negative Side Effects of Autonomous Systems in the Open World

Abstract

Talk to us

Similar Papers

More From: Journal of Artificial Intelligence Research