AI RESEARCH

Embodied3DBench: Benchmarking Low-Level Embodied Spatial Intelligence of Vision Language Models

arXiv CS.CV • May 29, 2026

ArXi:2605.29074v1 Announce Type: new Are current Vision Language Models (VLMs) ready to comprehend and reason about complex embodied interactions in 3D environments? We

Read Full Article