AI RESEARCH

GIRL-DETR: Gradient-Isolated Reinforcement Learning for Video Moment Retrieval

arXiv CS.AI

ArXi:2606.00775v1 Announce Type: cross Video Moment Retrieval (VMR) task requires accurately localizing temporal boundaries aligned with natural language queries, but many models suffer from a misalignment between continuous surrogate losses and non-differentiable metrics, leading to optimization stagnation during the late stages of