NVIDIA Drops a Model “LocateAnything”

Towards AI
AI Hardware

LocateAnything with Parallel Box Decoding Turns Visual Grounding Into an Agent Primitive