A Pilot Study of Observation Poisoning on Selective Reincarnation in Multi-Agent Reinforcement Learning

Harsha Putla,Chanakya Patibandla,Krishna Pratap Singh,P Nagabhushan

doi:10.1007/s11063-024-11625-w

Abstract

This research explores the vulnerability of selective reincarnation, a concept in Multi-Agent Reinforcement Learning (MARL), in response to observation poisoning attacks. Observation poisoning is an adversarial strategy that subtly manipulates an agent’s observation space, potentially leading to a misdirection in its learning process. The primary aim of this paper is to systematically evaluate the robustness of selective reincarnation in MARL systems against the subtle yet potentially debilitating effects of observation poisoning attacks. Through assessing how manipulated observation data influences MARL agents, we seek to highlight potential vulnerabilities and inform the development of more resilient MARL systems. Our experimental testbed was the widely used HalfCheetah environment, utilizing the Independent Deep Deterministic Policy Gradient algorithm within a cooperative MARL setting. We introduced a series of triggers, namely Gaussian noise addition, observation reversal, random shuffling, and scaling, into the teacher dataset of the MARL system provided to the reincarnating agents of HalfCheetah. Here, the “teacher dataset” refers to the stored experiences from previous training sessions used to accelerate the learning of reincarnating agents in MARL. This approach enabled the observation of these triggers’ significant impact on reincarnation decisions. Specifically, the reversal technique showed the most pronounced negative effect for maximum returns, with an average decrease of 38.08% in Kendall’s tau values across all the agent combinations. With random shuffling, Kendall’s tau values decreased by 17.66%. On the other hand, noise addition and scaling aligned with the original ranking by only 21.42% and 32.66%, respectively. The results, quantified by Kendall’s tau metric, indicate the fragility of the selective reincarnation process under adversarial observation poisoning. Our findings also reveal that vulnerability to observation poisoning varies significantly among different agent combinations, with some exhibiting markedly higher susceptibility than others. This investigation elucidates our understanding of selective reincarnation’s robustness against observation poisoning attacks, which is crucial for developing more secure MARL systems and also for making informed decisions about agent reincarnation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Pilot Study of Observation Poisoning on Selective Reincarnation in Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters

Lead the way for us

Journal: Neural Processing Letters	Publication Date: May 2, 2024
License type: CC BY 4.0

Similar Papers

Optimal bidding strategy for the price-maker virtual power plant in the day-ahead market based on multi-agent twin delayed deep deterministic policy gradient algorithm
Yuzheng Jiang ... Hexiang Huang
Energy | VOL. 306
Yuzheng Jiang, et. al.Yuzheng Jiang ... Hexiang Huang
11 Jul 2024
Energy | VOL. 306

An Improved DDPG Algorithm with Barrier Function for Lane-Change Decision-Making of Intelligent Vehicles
Tianshuo Feng ... Xiaochuan Zhang
-
Tianshuo Feng, et. al.Tianshuo Feng ... Xiaochuan Zhang
01 Jan 2020
01 Jan 2020

Deep Reinforcement Learning for Ecological and Distributed Urban Traffic Signal Control with Multi-Agent Equilibrium Decision Making
Liping Yan ... Jing Wang
Electronics | VOL. 13
Liping Yan, et. al.Liping Yan ... Jing Wang
13 May 2024
Electronics | VOL. 13

Multi-Agent Confrontation Game Based on Multi-Agent Reinforcement Learning
Shuo Han ... Liangjun Ke
-
Shuo Han, et. al.Shuo Han ... Liangjun Ke
15 Oct 2021
15 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Pilot Study of Observation Poisoning on Selective Reincarnation in Multi-Agent Reinforcement Learning

Abstract

Talk to us

Similar Papers

More From: Neural Processing Letters