AI RESEARCH
Robust Reasoning via Dynamic Token Selection for Distribution-Aligned Self-Distillation
arXiv CS.CL
•
ArXi:2606.00628v1 Announce Type: new Self-distillation improves learning efficiency by rewriting reference answers as