AI RESEARCH
Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance
arXiv CS.AI
•
ArXi:2606.00305v1 Announce Type: cross On-Policy Distillation (OPD) improves large language model reasoning by