AI RESEARCH

Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding

arXiv CS.CL

ArXi:2606.00564v1 Announce Type: cross While on-policy distillation offers dense supervision for