AI RESEARCH
Learning Action-Conditional and Object-Centric Gaussian Splatting World Models for Rigid Objects
arXiv CS.LG
•
ArXi:2606.01950v1 Announce Type: cross World models enable intelligent agents to predict the consequences of their actions on the environment. In this paper, we propose Multi Rigid Object Gaussian World Model (MRO-GWM), a novel model that learns action-conditional dynamics of rigid objects in 3D. By representing the scene by object-centric Gaussians, we can represent arbitrary object shapes and multi-object scenes. We develop a novel spatio-temporal transformer architecture that predicts future rigid body motion from a history of object Gaussians and future actions.