Skill Reuse as Compression in Agentic RL

ArXi:2605.31509v1 Announce Type: cross Large language model agents trained with reinforcement learning (RL) often learn brittle, task-specific shortcuts. We hypothesize that agents generalize better when their successful trajectories are structurally compressible, decomposed into a small set of reusable abstract patterns. To formalize this, we