AI RESEARCH
Tailoring Teaching to Aptitude: Direction-Adaptive Self-Distillation for LLM Reasoning
arXiv CS.AI
•
ArXi:2605.22263v1 Announce Type: cross On-policy self-distillation (OPSD) is an emerging LLM post-