AI RESEARCH

Understanding Goal Generalisation in Sequential Reinforcement Learning

arXiv CS.AI

ArXi:2605.23565v1 Announce Type: cross Reinforcement learning agents often exhibit unintended goal-directed behaviour outside their