AI RESEARCH
Trust Region On-Policy Distillation
arXiv CS.LG
•
ArXi:2606.01249v1 Announce Type: new On-Policy Distillation (OPD) is a fundamental technique for efficient post-