AI RESEARCH

Geo-Align: Video Generation Alignment via Metric Geometry Reward

arXiv CS.CV

ArXi:2605.23903v1 Announce Type: new Camera-controlled video generation has achieved remarkable progress in recent years. However, existing video-to-video re-rendering methods primarily rely on Supervised Fine-Tuning using synthetic datasets. At present, there is an extreme scarcity of synchronized, multi-view real-world video data. Consequently, the prevailing paradigm often exhibits limited generalization when processing out-of-distribution real-world videos, with models struggling to accurately adhere to physical scales and camera trajectories.