AI RESEARCH
Physical Object Understanding with a Physically Controllable World Model
arXiv CS.CV
•
ArXi:2606.00439v1 Announce Type: new A central challenge in visual intelligence is learning the physical structure of scenes from raw videos: how regions form objects and the laws that govern their interactions. Solving these tasks requires world models capable of inferring distributional states of the world from partial observations - capabilities that current architectures do not provide. We