AI RESEARCH
Video Reasoning without Training
arXiv CS.AI
•
ArXi:2510.17045v2 Announce Type: replace-cross Video reasoning using Large Multimodal Models (LMMs) relies on costly reinforcement learning (RL) and verbose chain-of-thought, resulting in substantial computational overhead during both