AI RESEARCH
ROGLE: Robust Global-Local Alignment with Automated Region Supervision for Text-Based Person Search
arXiv CS.CV
•
ArXi:2606.01825v1 Announce Type: new Text-Based Person Search (TBPS) aims to retrieve pedestrian images using natural language queries. However, existing TBPS models, especially those based on CLIP, struggle with fine-grained understanding due to global representational bias and semantic sparsity inherited from