AI RESEARCH
ADWIN: Adaptive Windows for Horizon-Aware On-Policy Distillation
arXiv CS.AI
•
ArXi:2605.28396v1 Announce Type: cross On-policy distillation (OPD) transfers reasoning behavior by