AI RESEARCH
Constitutional On-Policy Safe Distillation
arXiv CS.LG
•
ArXi:2606.03089v1 Announce Type: new On-policy self-distillation (OPSD) has emerged as an efficient post-