AI RESEARCH

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

arXiv CS.AI

ArXi:2602.01058v2 Announce Type: replace-cross Post-