AI RESEARCH
LongCat-Video-Avatar 1.5 Technical Report
arXiv CS.CV
•
ArXi:2605.26486v1 Announce Type: new Despite advances in audio-driven video generation, achieving commercial-grade stability remains challenging. We present LongCat-Video-Avatar 1.5, an upgraded open-source framework prioritizing systematic engineering and production-readiness over architectural novelty. By upgrading the audio encoder to Whisper Large and meticulously scaling our