AI RESEARCH
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning
arXiv CS.AI
•
ArXi:2602.01058v2 Announce Type: replace-cross Post-