AI RESEARCH
B-GRTO: Bootstrapped Group Relative Tool Optimization for Referring Segmentation
arXiv CS.LG
•
ArXi:2605.23500v1 Announce Type: cross Segmentation is a fundamental task in computer vision, underpinning pixel-level scene understanding and serving as a cornerstone for applications ranging from autonomous perception to medical image analysis. For complex referring segmentation, recent methods pair large vision-language models with segmentation decoders: the former analyzes the image and prompt, while the latter predicts the target mask.