AI RESEARCH
Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding
arXiv CS.CL
•
ArXi:2606.00564v1 Announce Type: cross While on-policy distillation offers dense supervision for