AI RESEARCH

Robust Reasoning via Dynamic Token Selection for Distribution-Aligned Self-Distillation

arXiv CS.CL

ArXi:2606.00628v1 Announce Type: new Self-distillation improves learning efficiency by rewriting reference answers as