AI RESEARCH

SANA-Streaming: Real-time Streaming Video Editing with Hybrid Diffusion Transformer

arXiv CS.AI

ArXi:2605.30409v1 Announce Type: cross Real-time streaming video-to-video editing (V2V) is critical for interactive applications such as live broadcasting and gaming, yet it remains a formidable challenge due to the stringent requirements for temporal consistency and inference throughput. In this paper, we present SANA-Streaming, a system-algorithm co-designed framework for high-resolution, real-time streaming video editing on consumer GPUs, with the following three core designs: (1) Hybrid Diffusion Transformer architecture.