AI RESEARCH

Large Language Models Hack Rewards, and Society

arXiv CS.AI

ArXi:2606.04075v1 Announce Type: cross Reinforcement learning (RL) has become a dominant post-