Out of Sight, Not Out of Mind: Unveiling Latent Attack in Latent-based Multi-Agent Systems

ArXi:2605.28214v1 Announce Type: cross Latent-based multi-agent systems replace parts of explicit inter-agent communication with hidden representations, offering a new direction for efficient and flexible agent collaboration. However, moving coordination into latent space may also move attacks beyond the reach of visible-text inspection. In this paper, we study whether latent states can carry attack-associated information that remains effective during clean executions. To examine this question, we