AI RESEARCH

Bridging Reasoning Trajectories in On-Policy Distillation via Near-Future Guidance

arXiv CS.AI

ArXi:2606.00305v1 Announce Type: cross On-Policy Distillation (OPD) improves large language model reasoning by