AI RESEARCH
Large Language Models Hack Rewards, and Society
arXiv CS.AI
•
ArXi:2606.04075v1 Announce Type: cross Reinforcement learning (RL) has become a dominant post-