Infra-Bayesian Reinforcement Learning Agents Outperform Classical RL For Worst-Case Robustness

ArXi:2605.23146v1 Announce Type: cross Classical reinforcement learning assumes the agent interacts with a fixed environment whose behavior does not depend on the agent's policy. This assumption breaks down in non-realizable settings where other actors might anticipate the agent's behavior, including environments crucial to AI safety, where the agent interacts with predictors, humans, other AI agents, and institutions. In such settings, the agent's model class fails to capture the world in which it operates.