AI RESEARCH

Constitutional On-Policy Safe Distillation

arXiv CS.LG

ArXi:2606.03089v1 Announce Type: new On-policy self-distillation (OPSD) has emerged as an efficient post-