AI RESEARCH

minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models

arXiv CS.CV

ArXi:2605.30263v1 Announce Type: new Recent video diffusion foundation models have achieved remarkable progress in high-quality video generation, yet turning them into real-time interactive video world models remains challenging. Interactive world models require controllable, causal, and low-latency rollout, which in practice demands a full pipeline spanning data construction, controllable fine-tuning, autoregressive