AI RESEARCH

Voluntary Collusion with Secret Tools in Competing LLM Agents

arXiv CS.AI

ArXi:2605.27593v1 Announce Type: new Even when a tool is explicitly described as unfair and harmful to others, ostensibly safety-aligned LLM agents still voluntarily engage in secret collusion whenever doing so confers a strategic advantage. To investigate this phenomenon, we