AI RESEARCH
Training-Trajectory-Aware Token Selection
arXiv CS.CL
•
ArXi:2601.10348v2 Announce Type: replace Efficient distillation is a key pathway for converting expensive reasoning capability into deployable efficiency, yet in the frontier regime where the student already has strong reasoning ability, naive continual distillation often yields limited gains or even degradation. We observe a characteristic