Beyond single-vector threats: perceptual and decisional joint attacks in multi-agent deep reinforcement learning

Weiqi Guo; Guanjun Liu; Ziyuan Zhou

doi:10.55092/aias20260001

Abstract

Multi-agent deep reinforcement learning (MADRL) has shown potential in solving complex cooperation and competition tasks among agents. Algorithms based on the Actor-Critic architecture have demonstrated excellent performance in continuous spaces. However, the robustness of such algorithms against adversarial attacks has not been thoroughly investigated. Existing studies primarily focus on either Perceptual attacks or Decisional attacks in isolation, without considering the impact of combining these two approaches. In this paper, we propose the Perceptual-Decisional Joint Attack (PDJA) framework that induces a strong synergistic disruption in MADRL systems. The framework executes a sequential, two-stage attack: (1) First, it perturbs the agent’s perception by utilizing the actor’s gradients. (2) Then, based on the agent’s reaction to this perturbed state, it attacks the agent’s decision by using the critic’s Q-value to apply a final perturbation directly to the output action. We evaluate our attack framework in the Multi-Agent Particle Environment (MPE) and the Multi-Agent MuJoCo (MAMuJoCo), demonstrating the synergistic effect of disrupting agent perception and decision-making, and the limitations of current defense strategies in handling this joint attacks.

Keywords

multi-agent deep reinforcement learning; adversarial attack; adversarial robustness; perception-decision joint attack

Preview

view pdf