AI RESEARCH

Tailoring Teaching to Aptitude: Direction-Adaptive Self-Distillation for LLM Reasoning

arXiv CS.AI

ArXi:2605.22263v1 Announce Type: cross On-policy self-distillation (OPSD) is an emerging LLM post-