AI RESEARCH

Trust Region On-Policy Distillation

arXiv CS.LG

ArXi:2606.01249v1 Announce Type: new On-Policy Distillation (OPD) is a fundamental technique for efficient post-