AI RESEARCH
Q-GeoMem: Question-Guided Geometric Memory for Video Spatial Reasoning
arXiv CS.CV
•
ArXi:2605.27318v1 Announce Type: new Video spatial reasoning requires accumulating viewpoint-dependent evidence over time while retaining information useful to the question being asked. Existing spatial video-language models improve geometric perception and long-range context modeling, but often treat memory as a generic temporal cache, which can