AI RESEARCH
Geometry-Aware Implicit Memory for Video World Models
arXiv CS.CV
•
ArXi:2606.02436v1 Announce Type: new Video world models aim to simulate controllable visual environments, but long-horizon rollouts depend on what the model remembers after observations leave its native context window. Explicit memories retain frames or online 3D reconstructions, which can suffer from heuristic retrieval errors, redundant appearance storage, or reconstruction artifacts. Implicit memories compress history into a compact state, but existing designs are not explicitly constrained to encode cross-view scene geometry.