AI RESEARCH

Pave-GRPO: Beyond Instantaneous Guidance through Principled Average Velocity Decomposition

arXiv CS.CV

Post-