AI RESEARCH
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification
arXiv CS.CV
•
ArXi:2508.21046v3 Announce Type: replace Recent Vision-Language-Action (VLA) models built on pre-trained Vision-Language Models (VLMs) require extensive post-