AI RESEARCH

MIND: Multi-Scale Intent Diffusion for Text-Driven Physics-Based Humanoid Control

arXiv CS.CV

ArXi:2605.26006v2 Announce Type: replace Enabling physics-based humanoids to execute diverse behaviors from high-level textual commands remains a significant challenge. Existing methods typically follow either a two-stage paradigm that combines kinematic motion generation with physics-based tracking, or an end-to-end imitation-learning paradigm that directly generates actions from text.