AI RESEARCH
SAW-Bench: Learning Situated Awareness in the Real World
arXiv CS.CV
•
ArXi:2602.16682v2 Announce Type: replace A core aspect of human perception is situated awareness, the ability to relate ourselves to the surrounding physical environment and reason over possible actions in context. However, most existing benchmarks for multimodal foundation models (MFMs) emphasize environment-centric spatial relations (relations among objects in a scene), while largely overlooking observer-centric relationships that require reasoning relative to agent's viewpoint, pose, and motion. To bridge this gap, we