ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use

ArXi:2606.00341v1 Announce Type: cross As AI agents are increasingly deployed in real personal and corporate settings (email accounts, development workflows, company databases, etc.), safety considerations surrounding these agents become paramount. Although much work has focused on agent safety in the presence of an adversary, we show that agents can exhibit misaligned behavior even in benign settings, taking unsafe actions when those actions are instrumental to task completion.